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Each year, approximately one third of the world's crops are destroyed by plant pests, amounting to 
billions of dollars in crop losses in the united States alone. Plants are susceptible to diseases and damage 
caused by an enormous number of different types of organisms, including virus, bacteria, fungi, algae, 
parasitic plants, weeds, insects, arachnids, and nematodes. The potential losses are kept in check by 

5 natural controlling mechanisms, and when these systems fail, by applications of various types of insecti- 
cides which typically act by attaching one specific, genetically controlled aspect of the target organism's 
metabolism. However, the efficacy of any given pesticide may be limited by the appearance and spread of 
resistance to the pesticide among the target population. The appearance and spread of insecticide 
resistance in wild populations argues for a genetic origin. First, a resistant genotype or trait appears in a 

10 local population and then with continued insecticide use (and thus, disproportionate survival of individuals 
with this genotype or trait), the resistance rapidly increases in the population and via migration resistance 
may spread to regional and perhaps even worldwide populations. Resistance may arise as a genetic allele 
already present within a population, or it may arise de novo . Nonetheless, whatever the cause, in a 
population with a short generation time (which is characteristic of many insects), the resistance trait can 

75 spread rapidly and quickly render ineffective the planned pattern of pesticide application. 

The continued development of natural strategies for insect control could be enhanced by an under- 
standing of the genetic basis of the resistance in economically important pests. Such studies have been 
ongoing, particularly with regard to insect pests, and a great deal has been learned about the major types of 
resistance observed in insects. At least three types of insect resistance have been identified: decreased rate 

20 of uptake, increased rate or degradation and changes in the target site. To some extent, certain aspects of 
the genetic mechanisms of these types of resistance have been determined; however, knowledge of the 
specific genetic basis for resistance has not yet been effectively applied in the field to monitor the 
occurrence of resistance, or to assist in planning effective insecticide applications to avoid or alleviate the 
development of resistance. Modification of insecticide application patterns can be critical in cases in which 

25 resistant insects are otherwise less fit than non-resistant insects; application of insecticide to which some 
individuals are resistant in these cases may actually select for increase in resistance in the population, when 
it might otherwise have been maintained only at low levels or entirely eliminated from the population. Thus, 
a method for exploiting the available knowledge of the genetic basis for resistance is greatly needed. 

Some of the most destructive of insect pests are found among the order Lepidoptera. The damage 

30 caused by lepidopterans is most frequently related to feeding activity of their larvae (caterpillars) on plants. 
Of the lepidopteran plant pests, among the most damaging are those insects related to the genus Heliothis . 
Two species of the genus Heliothis , H. virescens (the tobacco budworm) and H. armigera (American 
bollworm), and Helicoverpa zea (the corn ear worm) are responsible for a tremendous amount of damage to 
tobacco, cotton, corn, beans, alfalfa, and solanaceous plants in the United States. Over the years these 

35 pests have been controlled by application of a variety of insecticides; however, H. virescens has regularly 
developed resistance to compounds from virtually every major insecticide class. As one exception, until 
fairly recently the pyrethyroid class of insecticides continued to effectively control Heliothis in the field. 
Unfortunately, it has recently been noted that pockets of tolerance or resistance are beginning to appear in 
Heliothis virescens populations in various areas in the United States and in H. armigera and H. punctigera 

40 abroad. Because pyrethroids represent the most effective control of these insects, it is essential that 
widespread occurrence and/or spread of resistance to pyrethroids be avoided. 

Resistance to pyrethroids has been extensively studied in a variety of dipterans, and a number of 
different patterns of inheritance and explanations for resistance have been suggested. However, the basis 
for pyrethroid resistance or tolerance in lepidopterans generally, and in Heliothis specifically, has not yet 

45 been clarified. An understanding of the genetic mechanism of resistance, or even a definable genetic 
marker for resistance, would provide a much-needed basis for tracking the resistance trait accurately in a 
population. The present invention now provides the necessary tools for monitoring the occurrence and 
spread of resistance in a population, in particular for pyrethroid resistance in lepidopteran populations. 

so SUMMARY OF THE INVENTION 

The present invention provides an isolated nucleic acid fragment encoding all or a portion of a non- 
dipteran sodium channel. This channel is believed to be target site for sensitivity to a variety of different 
insecticides, including pyrethroids, and is useful as a marker for such target-insensitive insecticide 
55 resistance. Preferably the fragment encodes a lepidopteran, coleopteran or homopteran sodium channel. 
Sodium channels from both resistant and sensitive strains are encompassed herein. The nucleic acid 
fragment provides the basis for probes useful in detecting the presence of the resistance trait in a 
population of insects to be evaluated. Also provided are vectors containing the resistance gene which may 
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be used to introduce a gene encoding insecticide resistance into beneficial insects, such as honey bees. 
The invention also provides the isolated protein or fragment encoded thereby, as well as biologically or 
immunologically active fragments thereof, which protein or fragments are useful in generation of polyclonal 
and monoclonal antibodies. Such antibodies can be used to detect the presence of sensitive or insensitive 

5 sodium channels. In a preferred embodiment, the insecticide target is a Heliothis sodium channel. 

The invention also provides a means for monitoring, both quantitatively and qualitatively, the level of 
resistance in any given pesticide target population. The presence or absence of a resistance trait is 
determined by hybridizing whole genomic DNA, cDNA or one or more restriction fragments from one or 
more individuals from the population with a nucleic acid probe based on the sequence of a nucleic acid 

10 encoding a pesticide target site. Quantification of the trait is further obtained by calculating the number of 
the individuals having resistance relative to the number of sensitive individuals, and calculating the 
percentage occurrence of resistance. This in turn permits the observer to determine whether or not the 
contemplated pesticide application will be effective, whether alternate treatment may be required, or to 
predict when, at some time in the future, alternate treatment may be needed. In an alternate embodiment, 

15 the DNA can be used to express a recombinant protein or peptide, which in turn can be used to raise 
monoclonal antisera. Preferably antisera that can specify or identify both resistant and sensitive targets are 
raised. Such monoclonal antibodies may then be utilized in routine immunological procedures to determine 
the presence or absence of the resistant protein in a population. 

The present invention also provides the basis for an in vitro screen which will detect potential 

20 insecticidal activity. A nucleic acid sequence encoding a lepidopteran sodium channel can be inserted into 
a convenient host cell and a battery of potential insecticides tested for their ability to interfere with 
expression of either the gene or the encoded protein. 

BRIEF DESCRIPTION OF THE FIGURES 

25 

Figure 1 illustrates the nucleotide and amino acid sequences of the Heliothis clone hscpl, in 
comparison with the nucleotide and amino acid sequence of the para locus (sodium channel) of Drosophila 
melanogaster . "Dm" = Drosophila sequence; "scd" = portions of the Heliothis sequence; the numbers 
after "scd" refer to various subclones used to determine the sequence. The underlined amino acid 

30 sequences are membrane-spanning domains of the sodium channel. Superimposed above the sequences 
are the specific sequences of various primers (e.g. HSC 3455 + ) used in cloning and/or sequencing 
procedures. Numbering is based on the Drosophila homologue sequence to the Heliothis sodium channel. 

Figure 2 shows Restriction Fragment Length Polymorphisms (RFLPs) developed utilizing a labelled 
hcspl DNA sequence as a probe. "RR" identifies DNA derived from resistant individuals and "SS" refers to 

35 DNA derived from sensitive individuals. The presence or absence or resistant and sensitive individuals is 
made by the vial test described by Campanhola and Plapp, J. Econ. Entomol. , 82:1577-1533, 1989. 
Protocols for the procedure are described in Example 3. 

DETAILED DESCRIPTION OF THE INVENTION 

40 

As described in detail in the following Examples, the Heliothis sodium channel is isolated by 
amplification of Heliothis genomic DNA from an inbred susceptible strain using degenerate primers 
homologous to a portion of a sodium channel gene from Drosophila melanogaster (Loughney et aJL Cell 
58:1143-1154, 1989), as described in Example 2. A 184 bp amplification product is obtained which, upon 

45 sequencing, is found to encode an identical amino acid sequence when compared to the same region in the 
Drosophila gene. This PCR product is then labelled and hybridized to restriction enzyme-digested Heliothis 
genomic DNA. The highest molecular weight DNA fragment identified is from an EcoRI digest. 

Genomic DNA is then isolated from a resistant Heliothis strain and digested to completion with EcoRI. A 
genomic library is constructed in a g Zap II vector, and a labelled 184 bp fragment is then used to screen 

50 this library. One positive plaque yields a genomic clone of approximately 8000 bp which is referred to as 
n hscp1. w This clone shows significant homology to the published Drosophila sequence (Figure 1). 

Based on the hspc 1 sequence, a pair of primers designated 41 16 + , and 4399- (as depicted in Figure 
1) are used to amplify fragments of the sodium channel gene from both resistant and susceptible Heliothis 
individuals. Fragments are digested with either Rsal, Sau3AI or Msel. The restriction fragments are then 

55 separated and analyzed by gel electrophoresis. The resulting Restriction Fragment Length Polymorphisms 
(RFLPs) show distinct patterns unique to resistant and susceptible individuals. This demonstrates the utility 
of a nucleic acid sequence for defining genetic RFLP patterns useful for identifying resistant individuals 
within a population (Figure 2). 
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By homology with the known nucleic acid sequence for a Drosophila sodium channel, it is presumed 
that the isolated Heliothis sequence represents a portion of the corresponding Heliothis channel. Also, by 
comparison with the available information regarding the Drosophila channel as being the target site of 
pyrethroid action, it is reasonable to extrapolate this function in Heliothis as well. However, whether or not 

5 the isolated sequence represents the target site, or a genetic locus that is tightly linked with resistance, the 
RFLP results described above show that difference in the DNA is a reliable marker for identifying 
differences in susceptibility to insecticides that primarily target the sodium channel, particularly pyrethroids 
(but also chlorinated hydrocarbons and venom components such as the toxin derived from Androctonus 
australis [Aalt], saxitoxin, tetrodotoxin and the like) in an insect population. 

10 The isolation of the DNA sequence encoding the Heliothis sodium channel provides a number of 
advantages. First, in view of the unexpected high level of homology between Drosophila and Heliothis 
sodium channels, it must be assumed that channels of other lepidopteran species have similar or even 
higher homology to the Heliothis sodium channel. Thus, the Heliothis sodium channel DNA provides the 
basis for isolation of other lepidopteran channels. Such lepidopteran channels can be readily isolated by 

75 hybridization under medium (e.g., 1xSSC, 0.1% SDS, 55°C) or high (0.1 x SSC, 0.1% SDS, 65°C) 
stringency conditions using the Heliothis DNA or portion thereof, to function as an identifiable probe when 
screened against cDNA or whole genomic libraries from the species of interest. Isolation of DNA hybridizing 
under said conditions can be achieved by standard techniques. Lepidopteran species of interest include, 
but are not limited to: other Heliothis species, such as the American bollworm, H. armigera and the 

20 bollworm, H. punctigera ; lepidopteran species of the genus Spodoptera , e.g., the Egyptian cotton leafworm, 
S. litteralis , the beet armyworm, S. exigua ; the fall armyworm, S. frugiperda ; the cutworm, S. litura , the rice 
swarming caterpillar, S. mauritania and the Southerm armyworm, S. eridania ; and other miscellaneous 
leptdopterans, e.g., the pink bollworm, Pectinophora gossypiella ; the spiny bollworm, Earius insulana, the 
cotton leafworm, Alabama argillacea ; the leaf perforator, Bucculatrix thurberiella ; the tomato fruitworm, 

25 Helicoverpa zea ; the diamondback moth, Plutella xylostella ; the cabbage looper, Trichoplasia ni; the 
imported cabbageworm, Artogeia rapae ; the imported cabbageworms Hellula undalis and Hellula rogatatis ; 
the black cutworm, Agrotis ipsilon ; the corn earworm, Ostrinia nubalis ; the tomato pinworm, Keiferea 
lycopersicella ; the tomato hornworm, Manduca sexta and Manduca guinquemaculata ; the velvet bean 
caterpillar, Anticarsia gemmatalis ; the green oliveworm, Plathypena scabra ; the soybean looper, 

30 Pseudoplusia includens ; the saltmarsh caterpillar, Estigmene acrea ; the leaf miner, Epinotia meritana ; the 
codling moth, Cydia pomonella ; the oblique banded leaf roller, Choristoneura rosaceana ; grape berry moth, 
Lobesia botrans ; currant tortrix, Pandemis cerasana ; spotted tentiform leaf miner, Phy Monocytes blancardella ; 
grape leafroller Sparganothis pilleriana ; tufted bud apple moth, Platynota idacusalis ; red banded leafroller, 
Argyrotaenea velutinana ; oriental fruit moth, Grapholitha molesta ; Southwestern corn borer, Diatraea gran- 

35 diosella ; rice leafrollers, Cnaphalocrocis medinalis , M arasmia exigua and Marasmia patnalis ; striped borer, 
Chilo suppressalis ; dark headed stem-borer, Chilo polychrysis ; yellow stem borer, Scirophaga incatulas ; 
white stem borer, Scirophaga innotata ; and pink stem borer, Sesamia inferens . 

The isolated Heliothis nucleic acid fragment is also useful in other regards. The newly observed 
homology between Drosophila and Heliothis sodium channels predicts not only substantial homologies 

40 between Heliothis channels and other lepidopteran species, but also between Heliothis and other non- 
lepidopteran insect channels. Thus, the fragment, or portions thereof, can be utilized in developing RFLP's 
for other lepidopteran species, including, but not limited to, e.g., the lepidopteran species noted above, as 
well as non-lepidopteran species such as as the Colorado potato beetle Leptinotarsa decimlineator , the boll 
weevil, Anthonomus grandis ; the Southern corn rootworm, Diabrotica undecimpunctata ; the Japanese 

45 beetle, Popillia japonica ; plum curculio, Conotrachelus nenuphar ; brown planthopper, Nilaparvata lugens ; 
green leaf hopper, Nephotettix virescens ; potato leaf hopper, Empoasca abrupta; cotton aphid, Aphis gos- 
sypii ; green peach aphid, Myzus persicae ; sweetpotato whitefly, Bemisia tabaci ; imported fireant, 
Solenopsis invicta ; thrips, e.g., Thrips palini ; pear psylla, Psylla pyri ; two-spotted spider mite, Tetranychus 
urticae ; carmine mite, Tetranychus cinnabarinus ; citrus rust mite, Phyllocoptruta oleivora ; German cock- 
so roach, Blatella germanica ; cat flea, Ctenocephatides felis ; yellow fever mosquito, Aedes aegypti ; and salt 
marsh mosquito, Aedes sollicitans . The generation of useful RFLPs for these species is achieved in 
substantially the same manner as described herein for Heliothis . 

The Heliothis nucleic acid fragment or portions thereof can also be used as a probe, or can be used as 
the basis for designing degenerate probes, to screen genomic or cDNA libraries derived from such other 

55 non-lepidopteran insect species for specific sodium channels from these species. However, given the herein 
demonstrated high level of homology between the distantly related Drosophila and Heliothis , it is quite likely 
that the present Heliothis virescens fragment can be used directly as a probe for identifying resistant 
sodium channels by RFLPs for other lepidopteran and nonlepidopteran species, without the need for 
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isolation of those species' specific sodium channel DNA fragments. 

Continued monitoring and early detection of the presence of a resistance trait in any population is 
essential to effective insect control. By the time resistance is apparent at the gross level, it is very likely 
already at a point where further treatment with the pesticide is doomed to failure. For example, application 

5 of pyrethroids to a population in which resistance is already established will substantially increase the 
selection pressure favoring the appearance of the resistance trait. Whereas, in the absence of such 
selection, the resistant individuals are reproductively less fit than sensitive (wild-type) individuals. Hence, 
resistance would not otherwise have become established in the population without the application of 
insecticides. Thus, selective and timely application of pesticides or recognition of need for alternative 

w application of pesticides at an early stage can be critical in maintaining suitably sensitive insect populations. 
The identification of a genetic trait associated with resistance provides several avenues for tests to 
monitor the occurrence and frequency of resistance in a population at a very early stage, when frequency 
may be low and/or undetectable by standard bioassays. Early observance permits for informed judgments 
in the application of the relevant pesticide. For example, the gene encoding the resistant sodium channel 

75 provides the basis for informative southern or RFLP analysis of an insect population to identify the presence 
of the resistance trait in a given population. Detection of the unique DNA associated with a resistance allele 
(or the presence of two distinct alleles) therefore is diagnostic for the presence of the resistance trait in an 
analyzed sample. This may be determined, for example, by digesting genomic DNA collected from 
individuals of the target population in question and probing a Southern blot with detectably labelled DNA 

20 sequence that identifies a particular resistance trait, or a diagnostic portion thereof, to identify the presence 
or absence of the resistance allele. By "diagnostic portion" thereof is meant any fragment of the hscpl 
DNA which differs sufficiently in sequence from the corresponding portion of the susceptible DNA 
sequence, or a unique DNA sequence genetically linked to the trait, so as to assure its hybridization, under 
high stringency conditions, only with DNA encoding the resistance trait. It should be noted that sequences 

25 flanking the resistance gene, as well as intervening sequences (introns) are particularly suited for identifying 
unique diagnostic RFLPs. 

RFLP analysis also provides an attractive method of analyzing the existence and frequency of the 
resistance trait in the population. As the Examples below show, there is a detectable polymorphism 
associated with the sodium channel DNA between resistant and susceptible individuals. Thus, target 

30 population DNA can be analyzed for the presence of polymorphisms using the detectably labelled cloned 
hscpl DNA as a probe. In this technique, DNA from several individuals in the target population is digested 
with an appropriate restriction enzyme, and size separated by gel electrophoresis. The gel, or a blot derived 
therefrom, is then probed with labelled DNA, either the whole gene or fragment. If there are both resistant 
and sensitive alleles within an individual in the population, there will appear on the gel two different sized 

35 restriction fragments, each of which will hybridize with the hscpt probe. In this manner, large numbers of 
individuals in the population can be sampled, and the relative abundance of the allele can be determined. 
Identification of the specific DNA fragment associated with resistance, whether by Southern or RFLP 
analysis, will always be diagnostic. 

In this regard, the present invention also provides a kit for evaluating the extent to which a resistance 

40 gene is present in a given population. The kit will contain as its principle components (1) a restriction 
enzyme for digesting DNA, and (2) a detectably labelled probe comprising a nucleic acid fragment capable 
of hybridizing specifically with DNA encoding the resistance trait, or a nucleic acid fragment capable of 
hybridizing with the diagnostic RFLP marker. In a preferred embodiment, the kit also comprises (3) a means 
for extracting DNA from cells of the target population, and/or (4) PCR primers useful in amplifying the target 

45 DNA sequences. Also included may be a set of reference standards comprising sensitive and resistant 
DNA. 

As a specific example, a kit for the detection of altered sodium channels in a population would include 
(1) a restriction enzyme such as Tagl or EcoRI, which will generate fragments which show the relevant 
polymorphism, if present (2) a radioisotope-or biotin- labelled DNA comprising the sequence of the sodium 

so channel or fragments thereof; and optionally (3) a DNA extraction means. 

It will be recognized by those skilled in the art that variations or components (1) and (2) in particular are 
contemplated. Any restriction enzyme which produces a detectable polymorphism can be used. Preferably, 
the enzyme used will be a 4-cutter, such as Sau96l, ScrFI, Sau3A1, Rsal, Msel, Mspl, Mbol, Hpall, HinPI, 
Haelll, Dpnll, BstVI, and Bfal; or a 6-cutter, such as EcoRI, BamHI, Hindlll, Pstl, and Sail; less useful are 8- 

55 cutters, such as Notl, Stol, Pad, Sse36l, AscI, Fsel, Pmel, Rsrll, or SwaL The utility of any given restriction 
enzyme can readily be determined by digesting DNA known to contain alleles for both resistance and 
sensitivity with the candidate enzyme, and observing the presence or absence of a polymorphism by 
probing with hscpl, or any DNA linked to this region. Also, it will be understood that the "detectably 
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labelled" DNA may alternately be labelled so as to be detectable in any manner known in the art, e.g., by 
chemiluminescence, bioluminescencse, ELISA, biotinavidin, or any other appropriate means. The foregoing 
scheme is useful for detecting the presence of resistance to not only pyrethroids, but also DDT and 
arthropod toxins, such as the sodium channel toxin derived from Androctonus australis (AalT). 

5 Those skilled in the art will also recognize that the approach to resistant pest management described 
herein is not limited solely to control of resistance based on an altered sodium channel. Utilizing target site 
DNA as a means of tracking the presence of resistance in a population provides a far more precise and 
sensitive measure of the prevalence of resistance than do previously utilized methods. The target sites for 
many types of pesticides are now known, and therefore, the proposed genetic analysis for a resistance trait 

to can be applied to other insecticides as well. For example, acetylcholinesterase is known to be the target 
site for carbamate and organophosphate insecticides (Oakeshott et a].. PNAS USA 84:3359-3363, 1987). 
Organophosphate insecticides include malathion, methylparathion, diazinon, turbophos and dicrotophos; 
carbamates include sevin, Aldicarb, methionyl and thiodicarb. Target site resistance to some of these 
insecticides has been reported (Karunaratre et aL, Resist. Pest. Manag. Newsletter, 3:11-13, 1991; Chen, 

15 Resist. Pest Manag. Newsletter, 2:15, 1990). The acetylcholinesterase gene has been cloned (Fournier et 
aL, J. Mol. Biol. 210 :15-22, 1989), providing the basis for development of an analogous detection system for 
this type of resistance . Monooxygenase and mixed function oxidases (MFOs) have also been shown to be 
involved in resistance by increase in the rate of metabolism of organophosphates and carbamates 
(Brattstein et aL, Science, 1961349-1352, 1977; Brattstein et aL, Pesticide Biochem. Physiol., 3:393, 1973, 

20 Krieger et aL, science, 172:579, 1971; Matsumara, Toxicology Insecticides, Plenum Press, New York, 1975). 
Cyclodienes have been shown to act at the GABA receptor (Kadous et aL, Pestic. Biochem. Physiol. 
19:157-166, 1983; Tanaka et aL, Pestic. Biochem. Physiol. , 22:117-127, 1984); and target site resistance is 
known to exist (ffrench-Constant et aL, J. Econ. Entomol. 83:1733- 1737, 1990) and the receptor gene has 
been cloned (french-Constant et aL, PNAS USA, 88:7209-7213, 1991). Similarly, methoprene and certain 

25 botanical extracts (Precocenes) target the juvenile hormone (JH) receptor and resistance to these insecti- 
cides has been reported (Wilson et aL, Devel. BioL, 118:190-201, 1986; Georghiou et aL, J. Econ. EntomoL, 
71:544-547, 1978; Dyte, Nature, 238:48-49, 1972). Bacillus thuringiensis (Bt) toxins affect a gut associated 
glycoprotein but resistance has not become widespread. Diacyl hydrazine and certain botanical extracts 
(Penosterone A) target the ecdysone receptor (Wing, Science, 241:467-469, 1988; Spindler-Barth et aL, 

30 Arch. Ins. Biochem. and Phys., 16:11-18, 1991; Cherbas et aL, PNAS USA, 85:2096-2100, 1988) and the 
genes for the ecdysone receptor have also been cloned (Yao et aL, Cell, 71^:63-72, 1992; Koelle et aL, Cell, 
67:59-77, 1991). ~~ 

The use of this method is also not limited to detection of insecticide resistance, but may be applied to 
any other pesticides, including herbicides, acaricides, fungicides, nematicides, and molluscicides. A number 

35 of genes conferring resistance to herbicides have been characterized. For example, altered acetohydroxy 
acid synthase genes are the basis of resistance to sulfonylureas and imidazolinone herbicides (EP 
Application No. 91 119 254.0; Yadav et aL, PNAS USA 83:4418-4422, 1986). Glyphosate targets the enzyme 
5-enolpyruvate shikimate-3-phosphoric acid synthase, and mutant genes encoding resistant forms of this 
enzymes have been identified (Comai et aL, J. BioL Chem. , 260:4724-4728,1 985). Similarly, genes 

40 conferring resistance to the herbicides phosphothrinicin and bialyphos have also been characterized 
(Thompson et aL, EMBQ J, 6:2519-2523, 1987; DasSarma et aL, Science, 232:1242-1244, 1986). 

The target site of various fungicides is also known. For example, phenylamide fungicides, such as 
acylalanines (metalaxyl, furaxyl and bevalaxyl), butrolactones (ofurase, cyprofuran), and oxazolidinones 
(oxadixyl) are known to act on fungal RNA polymerase (Arp et aL, Fungizider. Mitt. BioL Bundesanst 236- 

45 237, 1981; Davidse, Neth. J. Plant Pathol. 87:11-24, 1981; EPPO Bull 15:403-409, 1985). Resistance to 
these fungicides has also been reported (Davidse et aL, J. Plant Pathol., 87:65-68, 1981; Davidse et aL, 
Experiment. Mycology, 7:344-361, 1983). The fungicide carboxin is known to have as a target site succinate 
dehydrogenase (Schewe et aL, in Modern Selective Fungicides, H. Lyr, ed. V.E.B. Gustan Fischer Vertag. 
Jene, 1987). Resistance and cloning of the resistance gene have also been reported (Keon et aL, Current 

50 Genetics, 19:475-481, 1991). The blasticidin fungicides, such as BlaS and Blasticidin S act on the enzyme 
nucleoside aminohydrolase; resistance has been observed and the gene conferring the resistance has been 
cloned (Kamakura et aL, Mol. Gen. Genet. 223:169-179, 1990; Kamakura et aL, Agric. BioL Chem., 51:3165- 
3168, 1987). The benzamidazole fungicides, such as benamyl, carbendazim, mocodazole and thiabenazole, 
act by affecting with microtubule function (Clemons et aL, Pesticide Biochem. Physiol., 1:32-43, 1971; 

55 Hammersdag et aL, Pesticide Biochem. Physiol., 3:42-54, 1973). Resistance is also known to occur to these 
fungicides (Van Tuyl, Med. Fac. Lonbouww Ryksuniv. Gent., 40:691-698, 1975); Meded. Landb. Hogesch. 
Wageningen, 77:1-137, 1977); Fanetran et aL, Mycol. Res., 95:943-951, 1991). The relevant resistance gene 
has been isolated and cloned (Jang et aL, Cell Motility and the Cytoskeleton, 17:87-94, 19906; Orbach et 
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al., Mol. Cell Biol., 6:2452-2461, 1986). 

Other applications of this method will be apparent to those skilled in the art, in view of the following 
non-limiting examples. 

5 EXAMPLES 

1 . DNA Preparation 

Genomic DNA is prepared from adults of an inbred American Cyanamid Company susceptible strain of 
10 Heiiotnis virescens as follows. A moth is placed in 400 ml of grinding buffer (0.1 M Tris-HCI, pH 9.0, 0.1 M 
EDTA, 1% SDS) and homogenized with a pestle. 80 ml of 5M KOAc and 400 ml equilibrated phenol is 
added; the sample is inverted several times and left to stand on ice for five minutes. Two hundred ml of ice 
cold chloroform is added, spun at 15,000 x g for five minutes, and supernatant removed. The procedure is 
repeated at least once. 

is Four hundred ul chloroform is added to the pellet, the sample inverted for 30 seconds and then spun 
for 5 minutes at 15,000 x g. The chloroform is removed, the sample spun again for one minute and the 
remaining chloroform removed. Two volumes of cold ethanol are added to the aqueous phase, and the 
sample left to stand five minutes at room temperature. The sample is once again spun for five minutes, the 
supernatant aspirated, and the pellet dried. The dried pellet is resuspended in 50 ul Tris-EDTA (10mM 

20 TRIS, 1mM EDTA, pH 8.0). 

2. Isolation of Channel Fragment from Genomic DNA 

The isolated genomic DNA is used as a template in PCR with primers based on portions of the 
25 Drosphila melanogaster para locus sodium channel. 

Specifically, degenerate primers homologous to portions of an exon in the fourth transmembrane 
domain of the a-subunit of the Drosophila para locus are constructed as follows: 



para 4991+ 5' (73) GAAATCACTCCCAATTA ATH GAR AAR TAY TTY CT 3' 
para 5143- 5' (H13-40) TTTCCCAGTCACGAC ATN GCR AAD ATR AAC AT 3' 



35 where H = A, C or T, R = A, G or T, Y = C or T, and N = any base. Numbers refer to 3' terminal base 
positions in the para sequence. Underlined sequences are universal primer tails T3 and M13 -40 
respectively used for sequencing of product. 

PCR reactions of 100 ul are constructed of approximately 1 mg of genomic DNA, 1 mg of each primer, 
0.2 mM of each dNTP, 10 mM Tris pH 8.3, 50 mM KCI, 2mM MgCfe. 0.001% gelatin, and 2 U of Tag 

40 polymerase. Reactions are incubated for 5 cycles, each of 50 seconds at 94°C, 2 minutes at an annealing 
temperature of 53° C, and 25 seconds at 72°C, then for 35 cycles with an annealing temperature of 45°C. An 
amplification product of 184 base pairs is obtained, and then directly sequenced using the Sequenase kit 
(United States Biochemical Co.) according to the manufacturers directions- The deduced amino acid 
sequence is found to be the same as for an equivalent region in para. 

45 Genomic DNA is also digested with several restriction enzymes, specifically EcoRI, BamHI, Sail, Hindlll, 
Pstl, and Xbal. The fragments are separated on agarose gel and transferred to a nylon support. The PCR 
product described above is radiolabeled and hybridized to the nylon blot at 60°C overnight. The blot is 
washed with a wash buffer (IMNaPi, 250 mM EDTA, pH8, 20% SDS; Napi = Na 2 HP0 7H 2 0, 134g and 
H3PO4 to pH7.2/liter) at 60°C three times for thirty minutes each. The filter is exposed to film. The film is 

50 developed after 12-24 hours of exposure at -80°C. The results show single bands in each lane indicative of 
a single copy gene. The largest band is for the EcoRI digest. 

Based on the foregoing information genomic DNA is prepared from an ICI America's pyrethroid 
resistant PEG-87 H. virescens strain using cesium chloride purification as described by Ausubel et al. 
(Current Protocols in Molecular Biology, Green Publ. Assn. and Wiley Interscience, 1989), and digested to 

55 completion with EcoRI. This DNA is used to construct a genomic library in the Lambda-Zapll vector 
(Stratagene Co., LaJolla, CA) following manufacturers* instructions. The 184 bp PCR fragment is used to 
screen this library by hybridization as described in standard Lambda-Zap II protocols. Several positive 
plaques are purified and the inserts excised in vitro following manufactuer's instructions, and subsequently 
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characterized. A genomic clone designated "hscpl" has approximately 8000 bp, and is extensively 
sequenced. For this first 990 base pairs of coding sequence, there is significant homology between hscpl 
and the published para sequence of Drosophila (Loughney et aL, Cell , 58:1143-1154, 1989). 

3. RFLP Analysis 

Fragments of the gene from individuals of both ICI- pyrethroid-resistant lines and American Cyanamid 
Company susceptible strains (collected Stoneville, Mississippi, 1963) are amplified by PCR using several 
pairs of primers based on the available hscpl sequence. In this specific example, hscp4116+ and 
hscp4399- are used. The PCR reactions, of 100 ml, consist of 100 ng-1mg of genomic DNA, 100 ng each of 
primer (hscp 41 16 + , 4399-, as shown in Figure 1) and other components as described above. Negative and 
positive control reactions are also made respectively, without template DNA or with hscpl DNA. 

Reactions are incubated for 30 cycles, each of 50 seconds at 94°C, 2 minutes at an annealing 
temperature of 56°C, and 1.5 minutes at 72°C. PCR products are purified with phenol, chloroform and 
precipitated using ammonium acetate-ETOH. PCR products are then apportioned among three different 
restriction enzyme reactions mixes following manufacturers' instructions (Rsal, Sau3AI, and Msel, New 
England Biolabs, Beverly MA), and incubated at 37°C overnight. Digestion products are resolved on a 3% 
"NuSieve" (FMC) agarose gel and stained with ethidium bromide at about 50ng/ml. The resulting restriction 
fragments length polymorphisms show a distinct pattern for each of the resistant and susceptible strains 
(Fig. 2), indicating the utility of this method in detecting the presence of resistant individuals among a 
generally susceptible population. 

DEPOSIT OF BIOLOGICAL MATERIALS 

The following materials have been deposited with the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, MD, on October 19, 1992 and have been given the following accession numbers. 



Deposit 


Accession No. 


Sodium channel para homolog (3' half of gene) from 
Heliothis virescens ICI strain PEG-87 (hscpl) 


ATCC 75334 
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15 



FIGURE 1 



Hdiothb and Dnxophila sodium channels. *** start /end of my sequences, _ gap, * same as above. 3/12/92 pi . 

para ATGACAGJJ^GATTCCGACTCGATAT^ 

I + — ♦ ♦ + + ♦ 7S 

MTEDSDSISEEERSLFRPFTRESLV 

Dm CAAATCC^CAACGCATTGCCGCTGAAC^^ 

76 * ♦ — ♦ + + — * + » 150 

QIEQRIAAEHEKQKELERKRAECEV 

Chi CCGCGATATGGTCGCAAGAAAAAACAAAAAGAAATCCGATATGA 

151 * * ♦ + ♦ ♦ * 225 

PRYGRKKKQKEIRYPDEDEDECFQ? 
Aintron A /\B 

Dm GATCCTACACTTGAACAOGGTGTGCCAATACC^ ATTGGCCTCCACT 

226 + + + + ♦ + * ♦ 200 

DPTLEQGVPIPVRLQGSFPPELAST 

Dm CCTCTCGAGGATATCGATCCCTACTACAGCAATGT 

20 301 + + * + + ♦ * 375 

PLEDIDPYYSNVLTFVVVSKGKDI F 

AC 

Cln CGCTITTCTGCATCAAMGCMTCT 

376 + ♦ + + + — ♦ + + 450 

RFSASKAMWHLDPFNPIRRVAI Y I 

25 Cm GTGCATCCATTATTTTCCCTATTCATCATCAC^ 

451 „ + * + + + + 525 

VHPLFSLFI ITTI LVNCTI. MIM P T T 
I -SI 

Dm (TCACGGTTGAGTCX^CTGAGGT^ 

526 + ♦ * * + + 4- + 600 

30 P T V £ S TEVIFTGT YTFESAVKVMA R 

I-S2 

Dm GGTTTCATmATGCCCGTTTACGTATCTT 

60 l ♦ + ♦ + + + # 675 

CFILCPFTYLRD AWNWL DFVVI ALA 

IS3 /\D 

35 

Dm TATGTGA<XATGGCTATAGATTTAGGTAATCTA^ 

676 ♦ ♦ + + + + 750 

Y V T H G T P L G N L A AT, RTFRVT. RAT. KT 

I-S4 

4Q SCp 788* AArACnAThGTnGGnGC - > 

Dm ' GTAGCCATTGTGCCAGGCTTGAAGACCATCGTCGG 

751 ♦ ♦ * + — - + - + 825 

VATVPGI, K TIVGAVIESVKNLRD V I 
/\E , I-SS 

Dm ATCCTGACCATGTICTCCCTGTC^^ 

45 826 * ♦ + . - ♦ + + , — + 900 

ILTMFSL SVFALMCLOTYMGVLT E K 

Cm TGCATCAAGAAGTTCCCGCTGGACGGTTCCTGGGG 

901 *- + + — »„_ + + + 97S 

CIKKFPLDGSWGNLTDENWDYHNRN 

Dm AGCTCCAATTGGTATTCCGAGGACGAGGGCATCTCA^ 

50 976 ♦ + ♦ ♦ + + 1050 

SSNWYSEDEGI SFPLCGNISGAGQC 

/\F 
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Hdiothis and Dmvphila sodium channels. start/end of my sequences* _ gap, * same as above. 3/12/92 p2. 
D&K* AAyCCnAAyT AyGGnTAyAC - > 

S F D S F G 

Doyle and Knipple's sequence AGTTTCGATTCATTCGGT 
Ctn GACCACCATTACCTIGTKrCTGCAGGGGT^^ 

1051 * + * . + + U2S 

DDDYVCL0GF3PNPNYCYTSFDSFG 

SCp 1153- < -TACTGnGTyCTrAArACC 

D&K- <-TAC7GnGTVCTrAArACCCTy 
W A F L S A F R _L 

.Dn TC^riTTCCTGTCCGCCTTCr^ 

1126 ♦ + * + + + ♦ . <_ 2 -)-; 

WAFLSArFvLKTQ'DFWEDLYQI. VLRA 

Dm gccggaccatggc^catgctgttctttatat:^ 

1201 ♦ * ♦ ♦ + + 127 5 

A G P W H HLFFIVIIFLGSFYI. VMl.Tf. 

I -So 

Dm GCCATTGTTGCCATGTC(nWIX^CC-A^^ 

1276 — -* — ♦ ♦ ♦ * + + 1350 

A .X .. V AMSYDEL^RKAEEEEAAEEEA I 

Dm CGTGAAGCGGAAGAAGCTGCCGCCGrCAAA^CGGCCAAGCTGGAGG^ 

135! „ . + + . + + 142S 

REAEEAAAAKAAKLEERANAQAQAA 

an GCGGATGOGGCTGCCGCCGAAGAGGriGC^ 

1426 ♦ + * + + f + l500 

ADAAAAEEAALHPEMAKSPTYSC IS 

On TATGAGCTATITGTTGGCCGCCAG^AGGGC.-A~GATC 

1501 ♦ * ♦ + . 157s 

yelfvgge:<g::ddnnkekmsirsve 

Dm CTGGAGTCGGAGTCGGTGAGCGTrATACAAAaACAAC^ 

1576 — ♦ * — - + + . + l6S0 

VESESVSVIQ? QPAPTTAHOATKVR 

Dm AAAGTGAGCACGTACACGATACGGAAr^C%w^ 

1651 * * + * „ 17 -> s 

KVSTYTIR:;G = GRFGIPGSDRKPLV 
Aalt- exon A -5 3 bp 

Do TTGTCAACATATCAGGATGCOCAGCAGC^ 

1726 + ♦ - ♦ + + + - * 1800 

LSTYQDAQQK1PYADDSNAVT PMS E 

GAGAATGGGGCCATCATAGTGCCCGTGTACTATGGCAATCTAGGC^ 

1801 * ^ . * * * + 1 8 7S 

ENGAI IVPVYYGNLCSRHSSYTSHQ 

tta VCCCGAATATCGTATACCTCACAT^ 

1876 ♦ + * -> + -_ * + 19SC 

SRI SYTSHGD^LGGMAVMGVSTMTK 

On GAGAGCAAATTGCGOWCCGCAACAZACGCA^TCAAT^ 

19 51 ♦ «.-— — + _ + + 2025 

esklrnrntr::qsvgatnggttcl d 
on accaatcacaagctcgatcatcgcx^ctacgaaattgg 

2026 ♦ + * * ♦ + ♦ + 2100 

TNHKLDHRDYZIGLECTDEAGKI KH 

&n CATGACAATGCTTTTATCGAGCCCGTCC^ 

2101 ♦ . , + + + 2175 

HDNPFIEPVQ7QTVVDMKDVMVLND 
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Figure 1 



5 Hdiothis and Drosophila sodium channels. — start /end of my sequences* _ gap, " same as above. 3/12/92 p3. 

an ATCATCGAACAGCCCGCTQCTCGGCACACTCGGG 

2176 ♦ v * * * «. * 2250 

I IEOAAGRHSRASCRGVSVYYFPTE 

/\H <— alt exon B — > MI 

Ckn GACGATGACGAGGATGGGCCGACGTTCAAAGACAAGGCACTCGAAGTw^ 

10 2251 ♦ ♦ * * * «. + 232S 

DDDEDG PTFKDKALEVI LKGI D V FC 

On GTGTGT^CTOTTGCTCJGCrmXXrm 

2326 ^ + * * r + * * 2400 

VWDCCWVWLKFQEWVS LIVFDPFVK 

II -SI 

75 

Dm CTCTTCATCttCGCTGrGCATTCTTC^^ 

2401 * * * + . , 2475 

L F I T L C I V _V NTMFMAM 3 HHDMNKEM 

C^^CGCGTGCTCAAGAGTGGCAACTATTrCTTCACCG 
2476 * + + + + * + ^ 2550 

20 E R vl ksghyfftatfa:eatmki. ma 

II -S2 

&T) ATGAGCCCCAAGTACTATTTCCAGGAGGGCTGGAACATCT^ 

2551 * + . . 2525 

MSP K Y Y F Q E GWHTFDFTIVALSLLE 
II-S3 

25 

Dm CTGGGACTCGAGGGTGTCCAGGGTCTGTCX^^ 

2626 ♦ - --♦ * — + * * 2700 

I C L E C V Q G L 5 VLRSFR:, T.RVFKr.AK 
II -S4 /\J 

Cm TCTTGGCCCACACTTAATTTACTCATTTCGATTAT^^ 
30 2701 * ♦ ♦ — - r + + ^ 277s 

S W P T L W LLISIMGRTMGALG N L T F V 



II-SS 

an CTTTCCATTATCATCTTCATCTTT^^ 

2776 4- -* * + A ¥ 

LC IIIFIFAVMGMOLFC K N Y H D H K D 
35 /\K 



40 



45 



28 SO 



50 



On CCKHTTCXXX^TGGCGACCTGCaXXX^^ 

2851 + ♦ * * ♦ f 2925 

RFPDGDLPRWNFTDFMHSFMIVFRV 

On CTCTOCGGAGAATGGATCGAGTXXATGTGGGACTGC^ 

2926 + — ♦ * — * «. 3000 

LCGEWIESHWDCKYVGO V.. SCIPFFL 

II-S6 

an GCCACCGTIXrrCATCGGCAATCTTGTGGTACT^ 

3001 + ♦ + «. + + + 3075 

ATVVIGMLVVLHLFr.A LLLSM F G S S 

an AGCTTATCAGaXXXyvCTGCCG^^ 

3076 — -+ ♦ ♦ * + — _ + + + 3150 

SLSA PTADNDTNKIAEAFNR ig rfk 

an AGTTGGGTTAAGOTAATATTGCTGATTGTTTCAAG^ 

31S1 * + - + f + + + 3225 

SWVKRNIADCFKLIRNKLTNQISDO 
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75 



5 Heiiothis and Dwsophila sodium channels. •** start/end of my sequences. _ gap, " same as above. 3/12/92 p4. 

Dm CCATCAGAGCATCGTC^CAACGAACT^ 

3226 ♦ + ♦ + ♦ , «. ^ 3300 

PSEHGDNELELGHDEILADGLI KKG 
/\ alt. exon E 39 bp 

Cm ATCMGCAGCAGACGCMCTGGAGC^^ 
70 "01 * ♦ ♦ . . 3375 

I KEQTQLEVA igdgmeftihgdmkn 

Dm AACAAGCCGAAGAAATCOVAATATCTAAATAACGCAACG 

3376 + * * + mrron L 

NKPK KSKYLNNAT 

START HSCP1 CLONE 

scd61 pBLS £cafil* * * AATTCACTATaCCAGGTAACTITriTGATACCTA 

scd6l GTTTAAAATAAGATAClVri^ri'ATCTMTAGGATTTTAAGAGTTCT^ 

Scd61 ACAATAAATACAAGAAAGGgcAAAATTTTGTTAMTAATATTAACGCAwt aACAGATAATCATAGAGACAACCGT 

scd6 1 TTAGACTGTGAATTAAATCATCACGGGTATC^^ 

20 HSC 34S5 * ("abelard") AAATCCTACGGCAGT. - . 

DDDTISQKSYGS 

scdol AATCAAGTTTCIX7rACTAAgAACACAATTTCTXTGTTT^ 

Dm GACGACGACACTCCCAGCATTAACTCATATGGTAGC 

incron L + > ♦ 3450 

DDDTASINSVGS 

— CATAA-> no incron 

„ HKIRSFKDESHKGSADTIDG? ? ?KD 

dt> scd61 CATAAAAtCAGGTCGTrcAAAGATGaAAGTCaTAaAGGTTCCGCA^ 

On CATAAGAATCGACCATTCAAGGACGAGAGCCACMGGGC^ 

3451 + * + , + ^ 3S2S 

HKNR PFKDESHKGSAETMEGEEKRD 
Aincron M 



ASKEE LGLEE E. . 
30 scdG 1 GCTaGTAAAGAGGAATTGCXrrTTAGAACAAGG^a&GTCT 

Dm GCCAGCAAGGAGGATn'AGGTCTCGACGAGG 

3526 ♦ + ♦ no intron 

ASKEDLGLDEE-. 

scd61 CCATATTTGGA 

35 ..MVEEEEDGKLDGGLGK 
scd61 Cy\ATTTGCATATAATTAATGTGTTAC£S^AATGGTTGAAGAAGAgG 

to AACTGGACCAGGAGGGCGAATGCGAGGAGCGCCCGCT^ 

♦ ♦ * * * 3600 

**L DEEGECEEGPLDG 



TDIIVAADEEVVDDSPADCCPEPCY 
: 1 ACAGaCATTATAGTGGecGCAG^tGAAGAAGTTGTTGACGAT^ 

GATATCATTATTCATGCACAOGACGAGGATATACTCGATGAATATCC^ 
2601 * ♦ ♦ — ♦ + + + 36?s 

D I I IKAHDEDILDEYPADCCPDSYY 



AKFPFLVGDDESPFWOGWGHLRLKT 
SCd61 GCGAAGtTTCCATrCCTTGTOXTP^ 
On AAGAAATTTCCGATCTTAGCCGCrTCACGATGA 

45 3676 ♦ -+ ♦ * + ♦ + 3750 

KKFPI LAG DDDSPFWQGWGNLR L KT 

FKLI ENTYFETAVITMILLSSLAL 
scd61 TTCAAACTCATTGAGAACACATATTTcCAAACGGCTGra 
ttn rTTCGATTAATTCAGGATAAATATTTTGAAACAGCTGT^ 

3751 ♦ ♦ + — + ♦ + +-- no intron 

50 F R LIEDKYFETAVTTMTLM Sfit. AT. 

III-Sl 
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Hdiothis and Drosophila sodium channels. *** start/end of my sequence*, _ gap, " same as above. 3/12/92 p5- 
scd61 ACTTCTCAAATAA 

A 

scd61 TTTTCTGAAcAc 1 IT 1 1 ; l'l N^Ac ATAGT AA.GGGAGcAAATT ATCTTCATGACGAAACTTry kCTGTCTTTACA£CCT 

^ CCA 

no intron 3825 

A 



scdfil 
Dm 

3326 



hscp3868- 
9Cp 39754 

sedfil 
Cm 

3901 



LEDVNL PHRP I LQDI LYYHDR I F T V 

rrAGAACATCTAAATTTACCACATCGACCC^rr^ 

TTAGAAGATXTTACATCTGCCACAMGACCCATACTGC^GGATATTTTATACT 

4. 4. + + + ^ + ^ ,(j 

LEDVHLPQRPIL QD ILYYMDRTFTV 

III-S2 

<-CCACGACC..- 

-4229+/3905+* ACnAAyGCnTGGTGyTGG-> 
IF" F I_E MLI KWLAL-jP O K Y F T N A W C W 
ATITTCTICATCGAGATGTTGATCA 
ATATTCTrcTTGGAAATGTTAATC^ACT^^^ 



I F F I, E H L... 



G F K V Y L T N A W C W 
III -S3 



hscp386B 

scd61 
Dm 

3976 



CAGCTGAAGTAGTA 

L D F I I V M 

CTCGACTTCATCATTGTCATGSI^TATTACTATAAATA^ 
CTCGATTTCGTGATTGTCATG 

+ incron :.* 

L D F V T V M 



scd6 1 TGCAGATTAGATTCGTAAAACCrTAGATTATg ATTATaGJLATTT^^ 

scd6 1 AAATrAAgGTTCAGGTCGGTCTTTGAAGT^^ 

scd6 1 TACAGTIxnTAAGTATACCTAGATCTAAGGAATAT 

SCd61 GTCGCTcGTAAACCACgGnCCACTGATgsTGGcg sGACgACATCCCnGTTCcGCTCGAtGcACgTTGngngcGCT 

SCd61 GCGGcTCCGc^XXXTTCHXTTCGCTg^^ 

£Cd61 TGGTATCGTTCXXXrTGCACATCCACACGATTG^ 

SCd61 GCTAATCGAAATAAGCAACCAAACXTTC^ 

scd6l TTCATATTCAACTMTCTCTTMTAAACrriTGTTCT 

scd61 ACTCAAAGCGTGCAGCrn^GATTOT 

scd6l AgcTCTCGAT GTCGT ATIVlT : OTl'l'ATGCAAAcCGCTTCAc GTGACTCTATAACCCACGACCcccGCTATATGA 

scd61 ATACCTGTGrCCCrrATATATaAPAACCTcCACAGAGTr^ 

scd61 ACtTTCTPCTATCTTOrrGCTtrr^ 

scd6l GATAAAATtXXyVsATTTGAAAGAAAACXTTTTAAM 

SC<36 1 ACAAACCAAACTATAAATACCTATACTATATAC^^ 

scd6 1 ATTTTAgGGACATCAGAGAAACGCTACCCTA^ 

SCd61 TATATAATT ATTTCCATXTC^TTrATTCTCTGATG tyC/ymAAr kwAinyTCGATGTAACCITATCriX3TAACTTX^ 

scdSl GTGAATATCACGTPCCTATCCCTCTGATTATOCIXX^ 

scdfil TTCTTTATAGTATCATATCCTTAGGTTTCrrA*** 

GAP IN HSCP SEQUENCE 



0TATCGC77ATCAAC 



intron N 



^TTCACTTGTTGGAGCTGGTGGTATTCAAGCX: 



S L T N " V A S L V G A G G I Q A 

III-S4 



Dm TTCJVAGACTATGCGAACGTTAAGAGCACTGAGACCACr7A 

4051 ♦ ♦ + no incron 

F K T M R T L P A L 3 P T = 



HSC 4116+ "Fred" TGAGCCGCATGCAGGGCATG- > 

HSC 4105- * jenny" < -ACGTCCCGTACTCCCATGCA 

AHSRMQGMR intron? 

scd72* * *ATTAGCGTTCAAAAGCGATGCGAAGCrcX^ 

GCCATCTXTCCGTATGCACGGCATGAGG 

no intron + + intron O 

■ A M fi R M O G M R 
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Heiiothis and Drosopkilm sodium channels — start /end of my sequences, _ gap, " same as above. 3/12/92 p6. 

SCd72 ACCA CCCTTOCTOCCG ACMC ACCCT 

sed72 TTCTATTTCAACTTCTACGATCATTT^^ 

scd72 tCGATATTTCTGCATsMTCACCGGftTCAAAATTTGTTT^ 

scd72 ACTACTCGATTTJVAGTAATr^ 

scd72 CTGTTCGTCATAAACCTCTrTTACAAAGGC 

scd72 rrrt CTATTGGTCTTATTATTAGCTAGATTAGACTTTTGTAATTACTTAG^ 

scd72 GCACCTTAGATnTTTCTrCTTGTATCTIt^TCGA* * * 



GAP IN HSCP SEQUENCE 



75 



scdl31 



* GCTAACTGCTACATACTn'ACTGCACAtrrATTAATCACA 



scd 131 TTAACGTCCTTATATCCCAACTAATAATGCGCCCACT 

P20ro4/ll ** C 

P20f4/ll 



20 



scdl31 CAGTACTT 

P20m4/ll * " T " GTGGGT ACCT ACACCCA 

P20f4/ll ------c-- 

Dm 



V V 

.CCAATATATCCTTCGTGACCAtTTCTACT 



GTCGTC 
intron O + 4125 

V V 



25 



30 



HSC 4211* CTGATCTTC. 

VNALVQAI PSIFNVLLVCLIFWLIF 
scdl3l GTAAACGCTKTTCCTrGCAACCGATCCCXTrOCA 

P20m4/ll 

P20f4/ll 
Cm 

4126 

VNALVQAI PS1 ? N 





(TrTAATGCGCTGGTACAAGCTATACCGTCCATC 




GTCTGTCTAATATTTTGGCTAATT 


TTT 



Y...L, U. 
III-S5 



VCLT FWLTF 



35 



4211+ — GCCATCA1XjGG-> 

HSC 4235* "4215*- ACAACTt7TTCGCTGGnAAA?A- > 

RRO 84 CAAA7ATTTCAACCTA TTAAT > 

SSO 8* AAAAT ATTTCAAGGTAAGCAG - > 
A IMGVQLFAGKYFK 

sodl 3 1 GCCATCATGGGAGTACAACTGTTCGCTGGC^ TTAATTTATTAACATAACAAAAA 

P20m4/U G 

P20£4/ll G * A 

Plm24/9 *** a 

Dm GCCT^TAATGGGTGTACAGCTTTTTGCTGGAAAATATTTTAAG 

4201 ♦ *♦ +-- intron P 

A IMGVOLFAC K V F K 



'AGCAGTA* *GT- 
•AGCAGTA"CT- 



HSCO 52- < -TAGAATAATCA . 
scdl3l AATATTTCAATTCGTAAAATCTTAT7AGT 
Plm24/9 - 



45 



scd!3l 

PHH24/9 

Dm 



.GACAAGTTTTA 
GTGTTCAAAATTTCTAACATl 
C 1 



CVDLNHTTLSH 
TTT^-TCTAJJTX^CGTCGACXTCAACCACACGACG^^ 



TGCGAGGACATCAATGGCACGAAGCTCAGCCAC 

ir.ircn P + ♦ + 4275 

CEDMNGTKLSH 



HSCP4343+ TCCCJvCAACTCACCGATCAACTT-> 
HSC 4325+ (4335+) ATCTTAGAGAACTACACCTGGGA-> 

E I I PDRNACI LENYTWEMSPMNFDH 
scdl31 GAAATCATCCCAGACCGGAATCCGTGCATCTTAG 

Plra24/9 G A 

Dm GAGATCATACXIAAATOGCAATGCCTGCXjAGAGCXjAGA 

4276 + + ♦ * + + + * 4350 

E I IPNRNACESENYTWVNSAMNFDH 
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HHiothis and Drosophila sodium channels. *** start/end of my sequences, _ gap, " same as above. 3/12/92 p7. 

HSC 4394- "Heloise" < -TCCCTACCTATGTCTAGTAC 

HSC 4415+ -4665+* AGGGATGGAT ACAGATCATGAA - > 
HSC 4399- 'Liz' < -ACCTATGTCTAGTACTTGCTGCG 

VGKAYLCLFQVATFKGWIQIMNDAI 
SCdUl GTCGGCAAGGOCTATCTXriXXXTIXTriCCAAGTGGCCA 
Plm24/9 *** 

Cm GTAGCrrAACGCGTATCTGTCCCTmCCAA^ 

4351 *- 7 ♦ ♦ ♦-- — * «■ + 442S 

VGNAYLCLFQVATFKCWIQIMNDAI 



D S R E 

scdl3l GATTCGAGAGAAG^AJGGCTACTATTrerm 
Dm CATTCACCAGAG 

4426 ir-cr-- ? 

75 D S R E 



scdl31 ACGCAATACAATAAATGATAT 



VGRQPIRETNIYMYLYFVFFI 
scdl3l TGTTAATGCCAGGTGGGCCGGCAACCTATACG 
20 C* 11 GTGGACAAGCAACCAATTXGTGAAACGAACATCTAC^^ 

intron Q + * + ♦ * ♦ 4500 

VDKQPIRETN IYMY LYFVFFI 

III-S6 



IFGSFFTLNLFIGVIIDNFNEQKKK 
scd!31 ATATITGGCTCATTCTTCACTCTC^ 
25 Dm ATATTTO^TCATTTTTCACACTCAATCTG^ 

4501 + + + ♦ + ♦ ♦ 457S 

IFGSFFTLNLFIGVT T P NFNEQKKK 



A GSLEMFMTEDQKKYY N A M K K M G S 

scdl31 aT _ _GCX^VGCcTIXAGATGTTCATGA 
Dm GCAGGTGGATCATTAGAAATGTTCATGACAGAAGATCAG 

4576 + + + * + + + + 4650 

AGGSLEMFM T E D Q K K Y Y S A M K K M G S 
PKC acciv'n site West et al Science 254, 866 

K K P L K A I PRPK? 
scdl31 AAAAAAOCTITAAAAGCTATCCCGAGACCGAAGG 
On AAAAAACCATTAAAAOCCATTCCAAGACCAAGG 

4651 + + ♦ intron R 

KKPLKA1 PRPR 



scdl31 TATtX3UtfjGAGGAGCGACTC TTATA TITG 

scdl31 C^GTACTGCCCTAAACCATTGTTrrCAACGCCA^ 

scdl31 GMTCTATICTAAGATTGATTTG<^GCCGTCC^ 

scdl31 CGTTCTAOCCAGATAAGAAAGCTTTCTACC 



WRPQAIVFEIVTDK 
TAAATAAACACTAACTGAAACTGTTTGTTCCAG^^ 
Cm TCGCGACCACAAGCAATAGTCTTTGAAATAGTAACCGATAAG 

intron R ♦ + + + 4725 

^ WRPQAIVFE T V T D K 

IV -SI 

KFDMIIMOFIGLNMLTMTLDHYQQS 
scdl31 AAGTraaCATGATCAiraT^^ 
□n AAATTCGATATMTCATTATGTTATTCATTGCTCTGAA^ 

4726 + + ♦ ♦ ♦— — + + + 4800 

50 KFDTTTMf. FTC t, NMFTHTT.n R Y D A S 



55 



15 



EP 0 615 976 A1 



FIGURE 1 



Hetiothis and Drosophila sodium channels. *** start /end of my sequences, _ gap, " same as above. 3/12/92 p8. 
HSC 4834- (8/10/90) < -TACTATAAGTAGCACTATAAGTC 

ETFSTVLDYLNHIFIVI FSSECLLK 
scdl31 GAGAOTTCAGCACTGTCCTCGACTACCT^ 
Cm GACAOTATAACGCGGTCCTAGACTATCTCM 

4801 + ♦ + — + + 4875 

D T Y N A VLDYLNA7PVVT FSSECL T. K 
IV-S2 



MFALRYHY F V EPWNLFDFVVVNFSI 
scdl31 ATGTIXXCCTTACGCTACCATTACT1TGTTGAGCCATGGA 
Cln ATATTCTXTTTTACGATATCACTATTTTATTGAGa^ 

4876 + ♦ ♦ + + + ♦ ♦ 495C 

I F A L R Y H Y F I EPWN LFDVVVV ILSI 

IV -S3 



L S. . 

scdl31 CTTAS33360^ATTTTXa3GTCTCCTO 
On TTAG 

4951 intron S 



SCpu 4991+ (5246+) T3 tAThGAr AArTAyTTyGT - > 

-.LVLSDI I EKYFVS PTL L R V V R V A 

TCTTTGTTCTCaCPTTTGGTATTGAG^ 

Cm GTCTTCTACTTAGCGATATTATCGACAA 

intron S ♦ + + ♦ + + + 5025 

..LVLSDI IE KYFVSPTLL R V V R V A 

IV-S4 



HSC 5097+ (5350+1 TTC?TTCC»«CTCGCCAT-> 
HSC 5083- < - AAGCGCCACCGGTACAGTGA 

HSC 5095- <-TACAGT. . . 

KVGRVLRLVKGAKGI RTLLFGLAMS 
scdl31 AAGGTCGGTCGTGTGTTGCGTCTCtrr^ 
Cm AAAGTGGGCCGTGTCCTTCGACTGGTGA^ 

5026 + ♦ ♦ *• + ♦ + + 5100 

KVGRVLRLVKR AK GIR T LLFALAMS 



HSC5095- .GACGGTCGGAATAA 

SCpu 5169+ (5426+) T3+GCnAThTTyGGnATG-> 

SCpu 5143- (5430-/5218-1 <-TACAArTAdAArCGnTA&.M13 . -40 

LPALFNICLLLFL VMFIFAIFGHSF 
scdl3l CnTXXAGCCTTATTCAACATCTGTCTC 
Dm CTGCCGGCCCTGTTCAACATCTGCCTG^ 

Sl 01 + + + + + ^ Sl75 

L P A L F H iCLLLFLVMPTFATFK MfiF 
IV-S5 



FMHVKDKGGLDDV VNFKTFVQSMI L 
scdl31 TTTATGCACGTC^AAGACAAAGCTGGTCTC^ 
Pm TTCATGCACGTGAAGGAGAAGAGCGGCATTAACGACGTCTA 

5176 + + — + * +-- + + + 5250 

F M H V K EKSGINDV VNFKTFGQSMIL 



L F Q 

scdl31 CTATTTCAGGJX2*iGTGTrACT 
Dm CTCTTTCAG 

5251 intron T 

L F Q 



scdl31 GCAGGGTTTATTCGTTCAAATTGAAACrrTAATm 
scdl31 ATCAAACACAGAGTGAGGTTGTTAATTTATGTGTT 
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FIGURE 1 



Hdiothis and DrosophOa sodium channels. *** start/end of my sequences, _ gap, " same as above. 3/12/92 p9. 
SCpu528S+(5540+/5269+> T3 +TCnGCnGGnTGGGAyGG- > 

MSTSAGWDGVLDGIINEEECDL 

scdl31 GTTTGTC&^TGTCGACGTQJGCC^ 

Cm ATGTCGACGTCAGCCGGTTGGGATGGTGTACTGGAOGCCATT 

intron T + + + + + + + 5325 

MSTSAGWDGVLDAIINEEACDP 



PDNERGYPGNCGSATIGITYLLSYL 
scdl31 CCCCACAACGACCGCGGCTACCCCC^^CT^^ 
an CCCGACAACGACAAAGGCTATCCGGGCAATTGTGGTTCA 

5326 + + ♦ + + * ♦ + 5400 

PDNDKGYPGNCGSATV GITFLL5YL 

IV-S6 

SCpu i>425- (57 12-) <-TTrTACATrTAdCGnCAgagtgaccggcagcaaa 

VISFLIVINMYIAVILENYSQAS* 
scdl 3 1 GTCATCTC CTICCT CATCGTCATCAACA 
Cm GTrTATAAGCTTTTTGATAGTTATTAATATCTA 

5401 + + + + + __ 5461 

VISFLTV IHMYIAVT^ E N G I 
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FIGURE 2 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: American Cyan amid Company 

(ii) TITLE OF INVENTION: Method for Monitoring Pesticide 
Reels tance 

w (iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: American Cyanamid Company 

(B) STREET: One Cyanamid Plaza 

(C) CITY: Wayne 

(D) STATE: New Jersey 
*5 (E) COUNTRY: USA 

(F) ZIP: 07470-8426 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

20 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: EP 93 118 061.6 

(B) FILING DATE: 08-NOV-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Wachtershauser Dr., Gunter 
(C) REFERENCE/ DOCKET NUMBER: EA-9088/31 , 732 

(ix> TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (089)293906 

(B) TELEFAX: (089)223759 

(C) TELEX: 5214173 Patw-D 



25 



40 



50 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AATTCACTAT ACCAGGTAAC TTTTTGATAC CTAGTTTAAA ATAAGATACT GTTGTTATCT 60 

AATAGGATTT TAAGAGTTGT CATAAACGTA ATGTTAATTT TTCAGGCGAC AATAAATACA 120 

AGAAAGGGCA AAATTTTGTT AAATAATATT AACGCAWTAA CAGATAATCA TAGAGACAAC 180 

CGTTTAGACT GTGAATTAAA TCATCACGGG TATCCTATAC AGGTAAATAT TTGTCGTCAC 240 

AGCTTKCTAA TAAATCACAA TCAAGTTTCT GTACTAAGAA CACAATTTCT CGTTTAGGAT 300 



55 
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GACGATACAA TTAGTCAAAA ATCGTACGGC 
AGTCATAAAG GTTCCGCAGA CACGATAGAT 

5 GAATTGGGTT TAGAAGAAGG TCAGTGTAAA 

AAGCCATATT TGGACAATTT GCATATAATT 
AAGATGGGAA GTTAGACGGA GGTCTAGGCA 

10 AAGTTGTTGA CGATAGCCCT GCTGACTGCT 

TCCTTGTGGG TGATGATGAA TCTCCCTTTT 
CCTTCAAACT CATTGAGAAC ACATATTTCG 

75 GTAGTTTGGC TTTGGTAAGT TCTCAAATAA 

GGAGCAAATT ATGTTCATGA CGAAACTTYK 
TTACCACATC GACCGATTCT TCAAGATATC 
ATTTTCTTCA TCGAGATGTT GATCAAATGG 

20 

AATGCGTGGT GCTGGCTCGA CTTCATCATT 
CTTTCGTATC ATTTGAACTA ACAGTTTCCT 
GATTATGGAA TTTGAACTTG TAAGTTCTGT 

25 

TCGGTCTTTG AAGTTTATCC TGCCGCCTCT 
TACAGTGTTA AGTATACCTA GATGTAAGGA 
GGTGTGGCGT TAGTTGTCGC TCGTAAACCA 

30 CNGTTCCGCT CGATGCACGT TGNGNGCGCT 

CATGCGCGTG AGTAGGACGG CACACCACTC 
GCACATCCAC ACGATTGTTT CACTCTACTT 

35 GCTAATCGAA ATAAGCAACC AAACCTTCCG 

AAATGTGAAC TCTGATTCAT ATTCAACTAA 
CTAATTCTAC TGTGTTTGAC GTGCAGCGCA 

40 GTCTATGGCA GTGGAAACTC CGAACGGCCT 

ATTGTTTGTT TATGGAAACC GCTTCATGTG 
ATACCTGTGR CCGTATATAT AAAAACCTCC 

45 AGTGCATGAA ACAACACGTC TTCTATCTTT 

GTACTACTCA CATTACCCAC ATCTGTCGGG 
TAAAACTGAA AATGGCACGT GATGTTGGTT 

50 AATACCTATA CTATATACAT ATCTTTGATA 

ATTTTAGGGA CATCAGAGAA ACGGTAGCCT 



AGTCATAAAA TCAGGTCGTT CAAAGATGAA 360 

GGCGAHGHGH MGAAGGACGC TAGTAAAGAG 420 

ACTGCAATTN AAAATTAACA GAATTGAACT 480 

AATGTGTTAC AGAAATGGTT GAAGAAGAGG 540 

AAACAGACAT TATAGTGGCC GCAGATGAAG 600 

GTCCAGAGCC ATGTTACGCG AAGTTTCCAT 660 

GGCAAGGCTG GGGCATGCTT CGGTTGAAAA 720 

AAACGGCTGT GATTACAATG ATTTTGCTCA 780 

TTTTCTGAAC ACTTTGTTTC ACATAGTAAG 840 

CTGTCTTTAC AGGGTTTAGA AGATGTAAAT 900 

TTGTATTATA TGGATCGGAT CTTCACCGTC 960 

CTTGCCCTTG GCTTCCAGAA ATACTTCACA 1020 

GTCATGGTAA TATTACTATA AATATATTTG 1080 

TGCAGATTAG ATTGGTAAAA CGTAGATTAT 1140 

ATAATGTGAA AGACAAAATT AAGGTTCAGG 1200 

CAGCGAGGTA AAGCTGGGAA GAATAATTTA 1260 

ATATATTGTA TACTAAAGTA AATGACGATT 1320 

CGGNGCAGTG ATGSTGGCGS GACGACATCC 1380 

GCGGCTCCGC GCGGTCTCTC GCTGGGAGGG 1440 

GTGCGCAGGC TGTGTTGGTA TCGTTGCGCT 1500 

TCTGCTGAGA AATCAGTGCA ACATGGTGTT 1560 

ACAGAGATTT TTATCTCGAA CCACTTTGTG 1620 

TCTCTTAATA AAGTTTGTTG TAAATATTTT 1680 

ACTCAAAGCG TGCAGCTTTG ATTGTTCGAT 1740 

CACCTCGCTG CCTCGAGCTC TCGATGTCGT 1800 

ACT CT ATAAC CCACGACCCC CGCTATATGA 1860 

ACAGAGTGAC TTGAAATCCT TATACTTTCA 1920 

GTGCTGTTGT GCGAGATAGT GCGTTTTCAC 1980 

GATAAAATCC GASATTTGAA AGAAAAGCTT 2040 

GCTGTCGATG TCATTACAAA GCAAACTATA 2100 

TTTGTTCTTA ATATGATGTG ATGTAGCTTT 2160 

AAGCTCAAAA TTAGAGCTTT TTGTAAAATC 2220 
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AATCCTGTTA 


ATTGCTATAT 


AATTATTTCC 


ATTTCTTTTA 


TTCTCTGATG 


KYCYYMAARK 


2280 




WAMYTCGATG 


TAACCTTATG 


TGTAACTTGA 


GTGAATATCA 


CGTTCCTATC 


CCTCTGATTA 


2340 


5 


TGCTGCAATA 
TATCCTTAGG 


GGAACTTCTG 
TTTGTA 


TTTCCAAATG 


AATCTTGAGA 


TTTTCTTCTT 


TATAGTATCA 


2400 
2416 




(2) INFORMATION FOR SEQ ID NO: 2: 










10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : single 

(D) TOPOLOGY: linear 








15 


(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 










ATTAGCGTTC 


AAAAGCGATG 


CGAAGCTGGG 


ACTGCGCTCT 


CAGGCCATGA 


GCCGCATGCA 


60 


20 


GGGCATGAGG 


GTACGTACCA 


CCCTGTGCTG 


CCGACAACAC 


CCTATCGCTC 


ATCCATCCAC 


120 




CACACACTTC 


GCTCCACACT 


TCACATTCAC 


ATTTCTATTT 


CAACTTCTAC 


GATCATTTTT 


180 




TAACATTTTA 


AAATTTCCAA 


CGTRCCAGCC 


GTACTMGGGC 


TCCTTTTTTC 


GATATTTCTG 


240 


25 


CATSAATCAC 


CGGATCAAAA 


TTTGTTTTTA 


ATAGTTAATT 


TGGACAGTTA 


TCCGATTCAT 


300 




TGGCAGTAGT 


CGATTGAAGT 


AATTATTAGT 


GAATCATTTT 


GAAGTGGTCG 


GTGGCACCCC 


360 




TGAATGGCTT 


AGTATCATCA 


CTGTTCGTCA 


TAAACCTCTT 


TTAGAAAGGG 


TCAATGGGAT 


420 


30 


TTATTGTGGA 


GAGATATTYR 


TCCATGTTTT 


GGTCTCTTTT 


CTATTGGTCT 


TATTATTAGC 


480 




TAGATTAGAC 


TTTTGTAATT 


ACTTAGTTAT 


TTGGAATGCT 


AATTTATA^T 


CTGCACCTTAj 


540 




GATTTTTTCT 


TCTTGTATCT 


TCATCGA 








567 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



50 



GCTAACTGCT 


ACATAGTTAC 


TGCACAGTAT 


TAATGACATT 


AACGTCCTTA 


TATCCCAACT 


60 


AATAATGCGC 


CCACTAACAA 


ATGCACGCCA 


TTGATATAAG 


AAAGGAGACG 


TATCAGTACT 


120 


TCCAATATAT 


CCTTCGTGAC 


CAGTGTAGTA 


ATACGTACGT 


ATGTGACAGG 


TGGTGGTAAA 


180 


CGCTCTCGTG 


CAAGCGATCC 


CGTCCATCTT 


CAACGTGTTG 


TTGGTGTGTC 


TTATCTTCTG 


240 
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10 



75 



20 



25 



30 



50 



55 



GCTGATCTTC GCCATCATGG GAGTACAACT GTTCGCTGGC AAATATTTCA AGGTATTAAT 300 

TTATTAACAT AACAAAAAAA TATTTCAATT CGTAAAATCT TATTAGTGTG TTCAAAATTT 360 

CTAACATGTT TTTCTTTGTT CTGTTCTAGT GCGTCGACCT CAACCACACG ACGTTGAGCC 420 

ACGAAATCAT CCCAGACCGG AATGCGTGCA TCTTAGAGAA CTACACCTGG GAGAACTCAC 480 

CGATGAACTT TGACCATGTC GGCAAGGCGT ATCTCTGCCT GTTCCAAGTG GCCACCTTCA 540 

AGGGATGGAT ACAGATCATG AACGACGCTA TTGATTCGAG AGAAGTATGG CTACTATTTC 600 

TTTTCCTTTT GTTCATAAGT TCATAAATTA ATATCAATAA AAATATCACG CAATACAATA 660 

AATGATATTG TTAATGCCAG GTGGGCCGGC AACCTATACG CGAGACGAAC ATCTACATGT 720 

ACCTGTACTT CGTGTTCTTC ATCATATTTG GCTCATTCTT CACTCTCAAC CTATTCATCG 780 

GTGTGATCAT CGACAACTTT AACGAACAGA AGAAGAAAGC CGGCGGCAGC CTTGAGATGT 840 

TCATGACTGA GGACCAGAAG AAATACTACA ATGCCATGAA GAAAATGGGT TCTAAAAAAC 900 

CTTTAAAAGC TATCCCGAGA CCGAAGGTAA CAGACGATTG CATTGTTTTT TGACCTCAAT 960 

GGAAACATAT CCAAGGAGGA GCGAGTCTTA TATTTGAAAC TTGATAGTAA TATTGTTGTA 1020 

TATTTTATAA TTTCATAAAC AGCAGT ACT G CGGTAAACCA TTGTTTTCAA CGCCAGAAAC 1080 

TGCAGGACGT TTAATTATTG AGGGATGATT TTGCCTAGAA TCTATTCTAA GATTGATTTG 1140 

GAGCCGTCCA CTTCCCAACG ACAGTTGCAG CATCTATGCC ACCGGACCAC GTCGTTGTAC 1200 

C CAG AT AAG A AAGCTTTCTA CCTAAATAAA CACTAACTGA AACTGTTTGT TCCAGTGGCG 1260 

G C CACAAGCG ATCGTGTTCG AGATAGTGAC GGACAAGAAG TTCGACATGA TCATCATGTT 1320 

GTTCATCGGC CTCAACATGT TGACGATGAC GCTCGATCAC TACCAGCAGT CGGAGACCTT 1380 

CAGCACTGTC CTCGACTACC TCAACATGAT ATTCATCGTG ATATTCAGTT CAGAGTGCCT 1440 

ATTAAAAATG TTCGCCTTAC GCTACCATTA CTTTGTTGAG CCATGGAACT TGTTCGATTT 1500 

CGTAGTAGTC AATTTCTCAA TTCTTAGTGA GTATTTTGGG TCTCCTGTTA TTCCAATAGT 1560 

AAAGTGTTTT CCATTTATAA TTTACTAATG ATACACTCTC TTTGTTCTCA GGTTTGGTAT 1620 

TGAGTGATAT TATAGAAAAA TATTTTGTGT CACCCACGTT ACTGAGGGTG GTGAGAGTAG 1680 

CGAAGGTCGG TCGTGTGTTG CGTCTCGTGA AGGGTGCGAA GGGTATCCGG ACGTTATTGT 1740 

TCGGGCTGGC CATGTCACTG CCAGCCTTAT TCAACATCTG TCTGCTGCTG TTCCTTGTGA 1800 

TGTTCATCTT CGCCATCTTC GGCATGTCGT TCTTTATGCA CGTCAAAGAC AAAGGTGGTC 1860 

TCGACGACGT GTACAACTTC AAGACCTTCG TGCAGAGTAT GATCCTGCTA TTTCAGGTCA 1920 

GTGTTACTAA TCATACTTTA GCGCCTCCTG GTTGCTTGAG GATGAATGAC CACAAGCAAC 1980 

CAGCAGGGTT TATTCGTTCA AATTGAAAGT TAATTTTTAG CCGTT CAAGC ATCTAGTGTA 2040 

TGCTAATCTG TCTTATCGTT TGTCAGATGT CGACGTCNGC CGGCTGGGAC GGCGTGCTGG 2100 

AC GG CAT CAT CAACGAGGAG G AGTGC GAN C TGCCGGACAA CGAGCGCGGC TACCCCGGCA 2160 
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ACTGCGGCTC TGCNACCATC GGCATCACCT ACCTGCTGTC CTACCTCGTC ATCTCCTTCC 2220 
TCATCGTCAT CAACATGTAC ATCGCCGTCA TTCTCGAGAA TTACTCGCAG GCAAGTTGA 2279 
5 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 196 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
w (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



75 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Asp Asp Asp Thr lie Ser Gin Lys Ser Tyr Gly Ser His Lys lie Arg 
15 10 15 

Ser Phe Lys Asp Glu Ser His Lys Gly Ser Ala Asp Thr lie Asp Gly 
20 25 30 

Xaa Xaa Xaa Lys Asp Ala Ser Lys Glu Glu Leu Gly Leu Glu Glu Glu 
35 40 45 

Met Val Glu Glu Glu Glu Asp Gly Lys Leu Asp Gly Gly Leu Gly Lys 
50 55 60 

Thr Asp He He Val Ala Ala Asp Glu Glu Val Val Asp Asp Ser Pro 
65 70 75 80 

Ala Asp Cys Cys Pro Glu Pro Cys Tyr Ala Lys Phe Pro Phe Leu Val 
85 90 95 

30 Gly Asp Asp Glu Ser Pro Phe Trp Gin Gly Trp Gly Met Leu Arg Leu 

100 105 110 

Lys Thr Phe Lys Leu He Glu Asn Thr Tyr Phe Glu Thr Ala Val He 
115 120 125 

35 Thr Met He Leu Leu Ser Ser Leu Ala Leu Ala Leu Glu Asp Val Asn 

130 135 140 

Leu Pro His Arg Pro He Leu Gin Asp He Leu Tyr Tyr Met Asp Arg 
145 150 155 160 



40 



He Phe Thr Val He Phe Phe He Glu Met Leu He Lys Trp Leu Ala 
165 170 175 

Leu Gly Phe Gin Lys Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp Phe 
180 185 190 

He He Val Met 
45 195 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 5: 

Ala Met Ser Arg Met Gin Gly Met Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 



(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



10 



20 



Val Val Val Asn Ala Leu Val Gin Ala lie Pro Ser lie Phe Asn Val 
15 10 15 

Leu Leu Val Cys Leu He Phe Trp Leu He Phe Ala lie Met Gly Val 
20 25 30 

Gin Leu Phe Ala Gly Lys Tyr Phe Lys Cys Val Asp Leu Asn His Thr 
25 35 40 45 

Thr Leu Ser His Glu He He Pro Asp Arg Asn Ala Cys He Leu Glu 
50 55 60 

Asn Tyr Thr Trp Glu Asn Ser Pro Met Asn Phe Asp His Val Gly Lys 
30 65 70 75 80 

Ala Tyr Leu Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp He Gin 
65 90 95 

He Met Asn Asp Ala He Asp Ser Arg Glu Val Gly Axg Gin Pro He 
35 100 105 110 

Arg Glu Thr Asn He Tyr Met Tyr Leu Tyr Phe Val Phe Phe He He 
115 120 125 



40 



50 



Phe Gly Ser Phe Phe Thr Leu Asn Leu Phe He Gly Val He He Asp 
130 135 140 

Asn Phe Asn Glu Gin Lys Lys Lys Ala Ala Gly Ser Leu Glu Met Phe 
145 150 155 160 

Met Thr Glu Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly 
165 170 175 

Ser Lys Lys Pro Leu Lys Ala He Pro Arg Pro Lys Trp Arg Pro Gin 
180 185 190 

Ala He Val Phe Glu He Val Thr Asp Lys Lys Phe Asp Met He He 
195 200 205 

Met Leu Phe He Gly Leu Asn Met Leu Thr Met Thr Leu Asp His Tyr 
210 215 220 



55 



24 



EP 0 615 976 A1 



75 



20 



25 



Gin Gin Ser Glu Thr Phe Ser Thr Val Leu Asp Tyr Leu Asn Met He 
225 230 235 240 

Phe He Val He Phe Ser Ser Glu Cys Leu Leu Lys Met Phe Ala Leu 
245 250 255 

Arg Tyr His Tyr Phe Val Glu Pro Trp Asn Leu Phe Asp Phe Val Val 
260 265 270 

Val Asn Phe Ser He Leu Ser Leu Val Leu Ser Asp He He Glu Lys 
275 280 285 

Tyr Phe Val Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val 
290 295 300 

Gly Arg Val Leu Arg Leu Val Lys Gly Ala Lys Gly He Arg Thr Leu 
305 310 315 320 

Leu Phe Gly Leu Ala Met Ser Leu Pro Ala Leu Phe Aen He Cys Leu 
325 330 335 

Leu Leu Phe Leu Val Met Phe He Phe Ala He Phe Gly Met Ser Phe 
340 345 350 

Phe Met His Val Lys Asp Lys Gly Gly Leu Asp Asp Val Tyr Asn Phe 
355 360 365 

Lys Thr Phe Val Gin Ser Met He Leu Leu Phe Gin Met Ser Thr Ser 
370 375 380 

Ala Gly Trp Asp Gly Val Leu Asp Gly He He Asn Glu Glu Glu Cys 
385 390 395 400 

Asp Leu Pro Asp Asn Glu Arg Gly Tyr Pro Gly Asn Cys Gly Ser Ala 
405 410 415 

Thr He Gly He Thr Tyr Leu Leu Ser Tyr Leu Ala Ala Val He Ser 
420 425 430 

Phe Leu He Val He Aen Met Tyr He Ala Val He Leu Glu Asn Tyr 
435 440 445 

Ser Gin Ala Ser 
450 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5461 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGACAGAAG ATTCCGACTC GATATCTGAG GAAGAACGCA GTTTGTTCCG TCCCTTTACC 60 

CGCGAATCAT TGGTGCAAAT CGAACAACGC ATTGCCGCTG AACATGAAAA GCAGAAGGAG 120 

CTGGAAAGAA AGAGAGCCGA GGGAGAGGTG CCGCGATATG GTCGCAAGAA AAAACAAAAA 180 
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GAAATCCGAT ATGATGACGA GGACGAGGAT 
CAGGGTGTGC CAATACCTGT TCGATTGCAG 

5 CCTCTCGAGG ATATCGATCC CTACXACAGC 

GGAAAAGATA TTTTTCGCTT TTCTGCATCA 
CCGATACGTC GTGTGGCCAT TTACATTCTA 

w ACCACAATTC TCGT CAACTG CATCCTGATG 

ACTGAGGTGA TATTCACCGG AATCTACACA 
GGTTTCATTT TATGCCCGTT TACGTATCTT 

75 GTAATAGCTT TAGCTTATGT GACCATGGGT 

ACGTTTAGGG TGCTGCGAGC GCTTAAAACC 
GTCGGCGCCG TCATCGAATC GGTGAAGAAT 
TCCCTGTCGG TGTTCGCGTT GATGGGCCTA 

20 

TGCATCAAGA AGTTCCCGCT GGACGGTTCC 
TATCACAATC GCAATAGCTC CAATTGGTAT 
TGCGGCAATA TATCCGGTGC GGGGCAATGC 

25 

GGTCCGAATC CGAATTATGG CTACACCAGC 
GCCTTCCGGC TGATGACACA GGACTTCTGG 
GCCGGACCAT GGCACATGCT GTTCTTTATA 

30 

GTGAATTTGA TTTTGGCCAT TGTTGCCATG 
GAAGAAGAGG CTGCCGAAGA GGAGGCGATA 
GCGGCCAAGC TGGAGGAGCG GGCCAATGCG 

35 GCCGAAGAGG CTGCACTGCA TCCGGAAATG 

TATGAGCTAT TTGTTGGCGG CGAGAAGGGC 
ATTCGGAGCG TCGAGGTGGA GTCGGAGTCG 

^0 ACCACAGCAC ACCAAGCTAC CAAAGTTCGT 

CGTGGCCGCT TTGGTATACC CGGTAGCGAT 
GATGCCCAGC AGCACTTGCC CTATGCCGAC 

45 GAGAATGGGG CCATCATAGT GCCCGTGTAC 

TATACCTCGC ATCAGTCCCG AATATCGTAT 
GCCGTCATGG GCGTCAGCAC AATGACCAAG 

5Q AATCAATCAG TGGGCGCCAC CAATGGCGGC 

GATCATCGCG ACTACGAAAT TGGCCTGGAG 



GAAGGTCCAC AACCGGATCC TACACTTGAA 240 

GGCAGCTTCC CGCCGGAATT GGCCTCCACT 300 

AATGTACTGA CATTCGTAGT TGTAAGCAAA 360 

AAAGCAATGT GGATGCTCGA TCCATTCAAT 420 

GTGCATCCAT TATTTTCCCT ATTCAT CATC 480 

ATAATGCCGA CAACGCCCAC GGTTGAGTCC 540 

TTTGAATCAG CTGTTAAAGT GATGGCACGA 600 

AGAGATGCAT GGAATTGGCT GGACTTCGTA 660 

ATAGATTTAG GTAATCTAGC AGCCCTGCGA 720 

GTAGCCATTG TGCCAGGCTT GAAGACCATC 780 

CTGCGCGATG TGATTATCCT GACCATGTTC 840 

CAGATCTATA TGGGCGTGCT CACCGAGAAG 900 

TGGGGCAATC TGACCGACGA GAACTGGGAC 960 

TCCGAGGACG AGGGCATCTC ATTTCCGTTA 1020 

GACGACGATT ACGTGTGCCT GCAGGGGTTT 1080 

TTCGATTCGT TCGGATGGGC TTTCCTGTCC 1140 

GAGGATCTGT ACCAGCTGGT GTTGCGCGCC 1200 

GTCAT CATCT TCCTAGGTTC ATTCTATCTT 1260 

TCGTATGACG AATTGCAAAG GAAGGCCGAA 1320 

CGTGAAGCGG AAGAAGCTGC CGCCGCCAAA 1380 

CAGGCTCAGG CAGCAGCGGA TGCGGCTGCC 1440 

GCCAAGAGTC CGACGTATTC TTGCATCAGC 1500 

AACGATGACA ACAACAAAGA GAAGATGTCC 1560 

GTGAGCGTTA TACAAAGACA ACCAGCACCT 1620 

AAAGTGAGCA CGTACACGAT ACGGAACGGA 1680 

CGTAAGCCAT TGGTATTGTC AACATAT CAG 1740 

GACTCGAATG CCGTCACCCC GATGTCCGAA 1800 

TATGGCAATC TAGGCTCCCG ACACTCATCG 1860 

ACCTCACATG GCGATCTACT CGGCGGCATG 1920 

GAGAGCAAAT TGCGCAACCG CAACACACGC 1980 

ACCACCTGTC TGGACACCAA TCACAAGCTC 2040 

TGCACGGACG AAGCTGGCAA GAT T AAACAT 2100 
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CATGACAATC CTTTTATCGA GCCCGTCCAG 
ATGGTCCTGA ATGACATCAT CGAACAGGCC 

5 GGTGTCTCCG TTTACTATTT CCCAACAGAG 

GACAAGGCAC TCGAAGTGAT CCTCAAAGGC 
TGGGTTTGGT TGAAATTT C A GGAGTGGGTA 

w CTCTTCATCA CGCTGTGCAT TGTGGTCAAC 

ATGAACAAGG AGATGGAACG CGTGCTCAAG 
GCCATCGAGG CCACCATGAA GCTAATGGCC 
TGGAACATCT TCGACTTCAT TATCGTGGCC 

75 

GTCCAGGGTC TGTCCGTATT GCGTTCCTTT 
TCTTGGCCCA CACTTAATTT ACTCATTTCG 
AATCTGACAT TTGTACTTTG CATTATCATC 

20 

TTCGGAAAGA ATTAT CATGA TCACAAGGAC 
AACTTCACCG ACTTTAT GCA CAGCTTCATG 
ATCGAGTCCA TGTGGGACTG CATGTACGTG 

25 

GCCACCGTTG TCATCGGCAA TCTTGTGGTA 
AATTTTGGCT CATCTAGCTT ATCAGCGCCG 
GAGGCCTTCA ATCGAATTGG C C G ATTT AAA 

30 

TTCAAGTTAA TACGTAACAA ATTGACAAAT 
GACAACGAAC TGGAGCTGGG CCACGACGAG 
ATCAAGGAGC AGACGCAACT GGAGGTGGCC 

35 GGCGACATGA AGAACAACAA GCCGAAGAAA 

GACACTGCCA GCATTAACTC ATATGGTAGC 
CACAAGGGCA GCGCCGAGAC GATGGAGGGC 

40 TTAGGTCTCG ACGAGGAACT GGACGAGGAG 

GATATCATTA TTCATGCACA CG AC GAGGAT 
CCCGATTCGT ACTATAAGAA ATTTCCGATC 

45 CAAGGATGGG GCAATTTACG ACTGAAAACT 

ACAGCTGTTA TCACTATGAT TTTAATGAGT 
CTGCCACAAA GACCCATACT GCAGGATATT 

50 ATATTCTTCT TGGAAATGTT AATCAAGTGG 

AACGCGTGGT GTTGGCTCGA TTTCGTGATT 



ACACAAACGG TGGTT GAT AT GAAAGATGTG 2160 

GCTGGTCGGC ACAGTCGGGC AAGCGATCGC 2220 

GACGATGACG AGGATGGGCC GACGTTCAAA 2280 

ATCGATGTGT TTTGTGTGTG GGACTGTTGC 2340 

TCGCTCATCG TCTTCGATCC CTTCGTCGAG 2400 

ACGATGTTCA TGGCAATGGA TCACCACGAT 2460 

AGTGGCAACT ATTT CTTCAC CGCCACCTTT 2520 

ATGAGCCCCA AGTACTATTT CCAGGAGGGC 2580 

CTATCGCTAT TGGAACTGGG ACTCGAGGGT 2640 

CGATTGCTGC GTGTATTCAA ACTGGCCAAG 2700 

ATTATGGGAC GCACCATGGG CGCTTTGGGT 2760 

TTCATCTTTG CGGTGATGGG AATGCAACTG 2820 

CGCTTTCCGG ATGGCGACCT GCCGCGCTGG 2880 

ATCGTGTTCC GGGTGCTCTG CGGAGAATGG 2940 

GGCGATGTCT CGTGCATTCC CTTCTTCTTG 3000 

CTTAACCTTT TCTTAGCCTT GCTTXTGTCC 3060 

ACTGCCGATA ACGATACGAA TAAAATAGCC 3120 

AGTTGGGTTA AG CGT AATAT TGCTGATTGT 3180 

CAAATAAGTG ATCAACCATC AGAGCATGGT 3240 

ATCCTCGCCG ACGGCCTCAT CAAGAAGGGG 3300 

ATCGGGGATG GCATGGAATT CACGATACAC 3360 

TCCAAATATC TAAATAACGC AACGGACGAC 3420 

CAT AAGAAT C GACCATTCAA GGACGAGAGC 3480 

GAGGAGAAGC GCGACGCCAG CAAGGAGGAT 3540 

GGCGAATGCG AGGAGGGCCC GCTCGACGGT 3600 

ATACTCGATG AATAT CCAGC TGATTGCTGC 3660 

TTAGCCGGTG ACGATGACTC GCCGTTCTGG 3720 

TTTCGATTAA TTGAGGATAA ATATTTTGAA 3780 

AGCTTAGCTT TGGCATTAGA AGATGTACAT 3840 

TTATACTATA TGGACAGAAT ATTTACGGTT 3900 

TTGGCGCTCG GCTTCAAAGT GTACTTGACC 3960 

GT CATGGT AT CGCTTATCAA CTTCGTTGCT 4020 
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TCACTTGTTG GAGCTGGTGG TATTCAAGCC TTCAAGACTA TGCGAACGTT AAGAGCACTG 4080 

AGACCACTAC GTGCCATGTC CCGTATGCAG GGCATGAGGG TCGTCGTTAA TGCGCTGGTA 4140 

5 CAAGCTATAC CGTCCATCTT CAATGTGCTA TTGGTGTGTC TAATATTTTG GCTAATTTTT 4200 

GCCATAATGG GTGTACAGCT TTTTGCTGGA AAATATTTTA AGTGCGAGGA CATGAATGGC 4260 

ACGAAGCTCA GCCACGAGAT CATAC CAAAT CGCAATGCCT GCGAGAGCGA GAACTACACG 4320 

10 TGGGTGAATT CAGCAATGAA TTTCGATCAT GTAGGTAACG CGTATCTGTG CCTTTTCCAA 4380 

GTGGCCACCT TCAAAGGCTG GATACAAATC ATGAACGATG CTATCGATTC ACGAGAGGTG 4440 

GACAAGCAAC CAATTCGTGA AACGAACATC TACATGTATT TATATTTCGT ATTCTTCATC 4500 

?5 ATATTTGGAT CATTTTTCAC ACTCAATCTG TTCATTGGTG TTATCATTGA TAATTTTAAT 4560 

GAGCAAAAGA AAAAAGCAGG TGGATCATTA GAAATGTTCA TGACAGAAGA TCAGAAAAAG 4620 

TACTATAGTG CTATGAAAAA GATGGGCTCT AAAAAACCAT TAAAAGCCAT TCCAAGACCA 4680 

AGGTGGCGAC CACAAGCAAT AGTCTTTGAA ATAGTAACCG ATAAGAAATT CGATATAATC 4740 

20 

ATTATGTTAT TCATTGGTCT GAACATGTTC ACCATGACCC TCGATCGTTA CGATGCGTCG 4800 

GACACGTATA ACGCGGTCCT AGACTATCTC AATGCGATAT TCGTAGTTAT TTTCAGTTCC 4860 

GAATGTCTAT TAAAAATATT CGCTTTACGA TATCACTATT TTATTGAGCC ATGGAATTTA 4920 

25 

TTTGATGTAG TAGTTGTCAT TTTATCCATC TTAGGTCTTG TACTTAGCGA TATTATCGAG 4980 

AAGTACTTCG TGTCGCCGAC CCTGCTCCGA GTGGTGCGTG TGGCGAAAGT GGGCCGTGTC 5040 

CTTCGACTGG TGAAGGGAGC CAAGGGCATT CGGACACTGC TCTTCGCGTT GGCCATGTCG 5100 

30 

CTGCCGGCCC TGTTCAACAT CTGCCTGCTG CTGTTCCTGG T CAT GTT CAT CTTTGCCATT 5160 

TTCGGCATGT CGTTCTTCAT GCACGTGAAG GAGAAGAGCG GCATTAACGA CGTCTACAAC 5220 

TTCAAGACCT TTGGCCAGAG CATGATCCTG CTCTTTCAGA TGTCGACGTC AGCCGGTTGG 5280 

35 

GATGGTGTAC TGGACGCCAT TATCAATGAG GAAGCATGCG ATCCACCCGA CAACGACAAA 5340 

GGCTATCCGG GCAATTGTGG TTCAGCGACC GTTGGAATAA CGTTTCTCCT CTCATACCTA 5400 

GTTATAAGCT TTTTGATAGT TATTAATATG TACATTGCTG TCATTCTCGA GAACGGAATT 5460 

4° C 5461 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1820 amino acids 
45 (B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

55 
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Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
1 5 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Val Gin He Glu Gin Arg lie Ala 
20 25 30 

Ala Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Glu Gly 
35 40 45 

Glu Val Pro Arg Tyr Gly Arg Lys Lys Lys Gin Lys Glu lie Arg Tyr 
50 55 60 

Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro Asp Pro Thr Leu Glu 
65 70 75 80 

Gin Gly Val Pro lie Pro Val Arg Leu Gin Gly Ser Phe Pro Pro Glu 
85 90 95 

Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro Tyr Tyr Ser Asn Val 
100 105 110 

Leu Thr Phe Val Val Val Ser Lys Gly Lys Asp lie Phe Arg Phe Ser 
115 120 125 

Ala Ser Lys Ala Met Trp Met Leu Asp Pro Phe Asn Pro lie Arg Arg 
130 135 140 

Val Ala lie Tyr lie Leu Val His Pro Leu Phe Ser Leu Phe lie He 
145 150 155 160 

Thr Thr He Leu Val Asn Cys He Leu Met He Met Pro Thr Thr Pro 
165 170 175 

Thr Val Glu Ser Thr Glu Val He Phe Thr Gly He Tyr Thr Phe Glu 
180 185 190 

Ser Ala Val Lys Val Met Ala Arg Gly Phe He Leu Cys Pro Phe Thr 
195 200 205 

Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe Val Val He Ala Leu 
210 215 220 

Ala Tyr Val Thr Met Gly He Asp Leu Gly Asn Leu Ala Ala Leu Arg 
225 230 235 240 

Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val Ala He Val Pro Gly 
245 250 255 

Leu Lys Thr He Val Gly Ala Val He Glu Ser Val Lys Asn Leu Arg 
260 265 270 

Asp Val He He Leu Thr Met Phe Ser Leu Ser Val Phe Ala Leu Met 
275 280 285 

Gly Leu Gin He Tyr Met Gly Val Leu Thr Glu Lys Cys He Lys Lys 
290 295 300 

Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr Asp Glu Asn Trp Asp 
305 310 315 320 

Tyr His Asn Arg Asn Ser Ser Asn Trp Tyr Ser Glu Asp Glu Gly He 
325 330 335 

Ser Phe Pro Leu Cyo Gly Asn He Ser Gly Ala Gly Gin Cys Asp Asp 
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340 345 350 

Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn Pro Asn Tyr Gly Tyr 
355 360 365 

Thr Ser Phe Asp Ser Pho Gly Trp Ala Phe Leu Ser Ala Phe Arg Leu 
370 375 380 

Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin Leu Val Leu Arg Ala 
385 390 395 400 

Ala Gly Pro Trp His Met Leu Phe Phe lie Val lie He Phe Leu Gly 
405 410 415 

Ser Phe Tyr Leu Val Asn Leu He Leu Ala He Val Ala Met Ser Tyr 
420 425 430 

Asp Glu Leu Gin Arg Lys Ala Glu Glu Glu Glu Ala Ala Glu Glu Glu 
435 440 445 

Ala He Arg Glu Ala Glu Glu Ala Ala Ala Ala Lys Ala Ala Lys Leu 
450 455 460 

20 Glu Glu Arg Ala Asn Ala Gin Ala Gin Ala Ala Ala Asp Ala Ala Ala 

465 470 475 480 

Ala Glu Glu Ala Ala Leu His Pro Glu Met Ala Lys Ser Pro Thr Tyr 
485 490 495 



15 



25 



Ser Cys He Ser Tyr Glu Leu Phe Val Gly Gly Glu Lys Gly Asn Asp 

500 505 510 

Asp Asn Asn Lys Glu Lys Met Ser He Arg Ser Val Glu Val Glu Ser 

515 520 525 

Glu Ser Val Ser Val He Gin Arg Gin Pro Ala Pro Thr Thr Ala His 

30 530 535 540 

Gin Ala Thr Lys Val Arg Lys Val Ser Thr Tyr Thr He Arg Asn Gly 

545 550 f 555 560 

Arg Gly Arg Phe Gly He Pro Gly Ser Asp Arg Lys Pro Leu Val Leu 

35 565 570 575 

Ser Thr Tyr Gin Asp Ala Gin Gin His Leu Pro Tyr Ala Asp Asp Ser 

580 585 590 



40 



45 



50 



Asn Ala Val Thr Pro Met Ser Glu Glu Asn Gly Ala He He Val Pro 

595 600 605 

Val Tyr Tyr Gly Asn Leu Gly Ser Arg His Ser Ser Tyr Thr Ser His 

610 615 620 

Gin Ser Arg He Ser Tyr Thr Ser His Gly Asp Leu Leu Gly Gly Met 

625 630 635 640 

Ala Val Met Gly Val Ser Thr Met Thr Lys Glu Ser Lys Leu Arg Asn 

645 650 655 

Arg Asn Thr Arg Asn Gin Ser Val Gly Ala Thr Asn Gly Gly Thr Thr 

660 665 670 

Cys Leu Asp Thr Asn His Lys Leu Asp His Arg Asp Tyr Glu He Gly 

675 680 685 
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Leu Glu Cya Thr Asp Glu Ala Gly Lys lie Lys His His Asp Asn Pro 
690 695 700 

Phe lie Glu Pro Val Gin Thr Gin Thr Val Val Asp Met Lys Asp Val 
705 710 715 720 

Met Val Leu Asn Asp lie lie Glu Gin Ala Ala Gly Arg His Ser Arg 
725 730 735 

Ala Ser Asp Arg Gly Val Ser Val Tyr Tyr Phe Pro Thr Glu Asp Asp 
740 745 750 

Asp Glu Asp Gly Pro Thr Phe Lys Asp Lys Ala Leu Glu Val He Leu 
755 760 765 

Lys Gly He Asp Val Phe Cys Val Trp Asp Cya Cya Trp Val Trp Leu 
770 775 780 

Lys Phe Gin Glu Trp Val Ser Leu He Val Phe Asp Pro Phe Val Glu 
785 790 795 800 

Leu Phe He Thr Leu Cys He Val Val Asn Thr Met Phe Met Ala Met 
805 810 815 

Asp His His Asp Met Asn Lys Glu Met Glu Arg Val Leu Lys Ser Gly 
820 825 830 

Asn Tyr Phe Phe Thr Ala Thr Phe Ala He Glu Ala Thr Met Lys Leu 
835 840 845 

Met Ala Met Ser Pro Lys Tyr Tyr Phe Gin Glu Gly Trp Asn He Phe 
650 855 860 

Asp Phe He He Val Ala Leu Ser Leu Leu Glu Leu Gly Leu Glu Gly 
865 870 875 880 

30 Val Gin Gly Leu Ser Val Leu Arg Ser Phe Arg Leu Leu Arg Val Phe 

885 890 895 

Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Leu Leu He Ser He Met 
900 905 910 



75 



20 



25 



35 



Gly Arg Thr Met Gly Ala Leu Gly Asn Leu Thr Phe Val Leu Cys He 
915 920 925 

He He Phe He Phe Ala Val Met Gly Met Gin Leu Phe Gly Lys Asn 
930 935 940 

Tyr His Asp His Lys Asp Arg Phe Pro Asp Gly Asp Leu Pro Arg Trp 
40 945 950 955 960 

Asn Phe Thr Asp Phe Met His Ser Phe Met He Val Phe Arg Val Leu 
965 970 975 

Cys Gly Glu Trp He Glu Ser Met Trp Asp Cys Met Tyr Val Gly Asp 
45 980 985 990 

Val Ser Cys He Pro Phe Phe Leu Ala Thr Val Val He Gly Asn Leu 
995 1000 1005 



50 



Val Val Leu Asn Leu Phe Leu Ala Leu Leu Leu Ser Asn Phe Gly Ser 
1010 1015 1020 

Ser Ser Leu Ser Ala Pro Thr Ala Asp Asn Asp Thr Asn Lys He Ala 
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1025 1030 1035 1040 

Glu Ala Phe Asn Arg lie Gly Arg Phe Lye Sar Trp Val Lys Arg Asn 
1045 1050 1055 

He Ala Asp Cyo Phe Lys Leu He Arg Asn Lye Leu Thr Asn Gin He 
1060 1065 1070 

Ser Asp Gin Pro Ser Glu His Gly Asp Asn Glu Leu Glu Leu Gly His 
1075 10B0 1085 

Asp Glu He Leu Ala Asp Gly Leu He Lys Lys Gly He Lys Glu Gin 
1090 1095 1100 

Thr Gin Leu Glu Val Ala lie Gly Asp Gly Met Glu Phe Thr He His 
1105 1110 1115 1120 

Gly Asp Met Lys Asn Asn Lys Pro Lys Lys Ser Lys Tyr Leu Asn Asn 
1125 1130 1135 

Ala Thr Asp Asp Asp Thr Ala Ser He Asn Ser Tyr Gly Ser His Lys 
1140 1145 1150 

Asn Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr Met 
1155 1160 1165 

Glu Gly Glu Glu Lys Arg Asp Ala Ser Lys Glu Asp Leu Gly Leu Asp 
1170 1175 1180 

Glu Glu Leu Asp Glu Glu Gly Glu Cys Glu Glu Gly Pro Leu Asp Gly 
1185 1190 1195 1200 

Asp He He He His Ala His Asp Glu Asp He Leu Asp Glu Tyr Pro 
1205 1210 1215 

Ala Asp Cys Cys Pro Asp Ser Tyr Tyr Lys Lys Phe Pro He Leu Ala 
1220 1225 1230 

Gly Asp Asp Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu 
1235 1240 1245 

Lys Thr Phe Arg Leu He Glu Asp Lys Tyr Phe Glu Thr Ala Val He 
1250 1255 1260 

Thr Met He Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1265 1270 1275 1280 

Leu Pro Gin Arg Pro He Leu Gin Asp He Leu Tyr Tyr Met Asp Arg 
1285 1290 1295 

He Phe Thr Val He Phe Phe Leu Glu Met Leu He Lys Trp Leu Ala 
1300 1305 1310 

Leu Gly Phe Lys Val Tyr Leu Thr Asn Ala Trp Cys Trp Leu Asp Phe 
1315 1320 1325 

Val He Val Met Val Ser Leu lie Asn Phe Val Ala Ser Leu Val Gly 
1330 1335 1340 

Ala Gly Gly He Gin Ala Phe Lys Thr Met Arg Thr Leu Arg Ala Leu 
1345 1350 1355 1360 

Arg Pro Leu Arg Ala Met Ser Arg Met Gin Gly Met Arg Val Val Val 
1365 1370 1375 
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Asn Ala Leu Val Gin Ala lie Fro Ser lie Phe Asn Val Leu Leu Val 
1380 1385 1390 

Cys Leu lie Phe Trp Leu He Phe Ala He Met Gly Val Gin Leu Phe 
1395 1400 1405 

Ala Gly Lys Tyr Phe Lys Cys Glu Aep Met Asn Gly Thr Lys Leu Ser 
1410 1415 1420 

His Glu He lie Pro Asn Arg Asn Ala Cys Glu Ser Glu Asn Tyr Thr 
1425 1430 1435 1440 

Trp Val Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1445 1450 1455 

Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp He Gin He Met Asn 
1460 1465 1470 

Asp Ala He Asp Ser Arg Glu Val Asp Lys Gin Pro He Arg Glu Thr 
1475 1480 1485 

Asn He Tyr Met Tyr Leu Tyr Phe Val Phe Phe He He Phe Gly Ser 
1490 1495 1500 

Phe Phe Thr Leu Asn Leu Phe He Gly Val He He Asp Asn Phe Asn 
1505 1510 1515 1520 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1525 1530 1535 

Asp Gin Lys Lys Tyr Tyr Ser Ala Met Lys Lys Met Gly Ser Lys Lys 
1540 1545 1550 

Pro Leu Lys Ala He Pro Arg Pro Arg Trp Arg Pro Gin Ala He Val 
1555 1560 1565 

Phe Glu He Val Thr Asp Lys Lys Phe Asp He He He Met Leu Phe 
1570 1575 1580 

He Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1585 1590 1595 1600 

Asp Thr Tyr Asn Ala Val Leu Asp Tyr Leu Asn Ala He Phe Val Val 
1605 1610 1615 

He Phe Ser Ser Glu Cys Leu Leu Lys He Phe Ala Leu Arg Tyr His 
1620 1625 1630 

Tyr Phe He Glu Pro Trp Asn Leu Phe Asp Val Val Val Val He Leu 
1635 1640 1645 

Ser He Leu Gly Leu Val Leu Ser Asp He He Glu Lys Tyr Phe Val 
1650 1655 1660 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
1665 1670 1675 1680 

Leu Arg Leu Val Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe Ala 
1685 1690 1695 

Leu Ala Met Ser Leu Pro Ala Leu Phe Asn He Cys Leu Leu Leu Phe 
1700 1705 1710 

Leu Val Met Phe He Phe Ala He Phe Gly Met Ser Phe Phe Met His 
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1715 1720 1725 

Val Lys Glu Lys Ser Gly He Asn Asp Val Tyr Asn Phe Lys Thr Phe 
1730 1735 1740 

Gly Gin Ser Met He Leu Leu Phe Gin Met Ser thr Ser Ala Gly Trp 
1745 1750 1755 1760 

Asp Gly Val Leu Asp Ala He He Asn Glu Glu Ala Cys Asp Pro Pro 
1765 1770 1775 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1780 1785 1790 

He Thr Phe Leu Leu Ser Tyr Leu Val He Ser Phe Leu He Val He 
1795 1800 1805 

Asn Met Tyr He Ala Val He Leu Glu Asn Gly He 
1810 1815 1820 

(2) INFORMATION FOR SEQ ID NO: 9: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATGAGCCGCA TGCAGGGCAT GAGGGTACGT ACCACCCTGT GCTGCCGACA ACACCCTATC 60 

GCTCATCCAT CCACCACACA CTTCGCTCCA CACTTCACAT TCACATTTCT ATTTCAACTT 120 

CTACGATCAT TTTTTAACAT TTTAAAATTT CCAACGTRCC AGCCGTACTM GGGCTCCTTT 180 

TTTCGATATT TCTGCATSAA TCACCGGATC AAAATTTGTT TTTAATAGTT AATTTGGACA 240 

GTTATCCGAT TCATTGGCAG TAGTCGATTG AAGTAATTAT TAGTGAATCA TTTTGAAGTG 300 

GTCGGTGGCA CCCCTGAATG GCTTAGTATC ATCACTGTTC GTCATAAACC TCTTTTAGAA 360 

AGGGTCAATG GGATTTATTG TGGAGAGATA TTYRTCCATG TTTTGGTCTC TTTTCTATTG 420 

GTCTTATTAT TAGCTAGATT AGACTTTTGT AATTACTTAG TTATTTGGAA TGCTAATTTA 480 

T ATT CTGCAC CTTAGATTTT TTCTTCTTGT ATCTTCATCG A 521 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



50 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 





GCTAACTGCT 


ACATAGTTAC 


TGCACAGTAT 


TAATGACATT 


AACGTC CTTA 


TATCCCAACT 


60 


5 


AATAATGCGC 


CCACTAACAA 


ATGCACGCCA 


TTGATATAAG 


AAAGGAGACG 


TATCAGTACT 


120 




TCCAATATAT 


CCTTCGTGAC 


CAGTGTAGTA 


ATACGTACGT 


ATGTGACAGG 


TGGTGGTAAA 


180 




CGCTCTCGTG 


CAAGCGATCC 


CGTCCATCTT 


CAACGTGTTG 


TTGGTGTGTC 


TTATCTTCTG 




10 


GCTGATCTTC 


GCCATCATGG 


GAGTACAACT 


GTTCGCTGGC 


AAATATTTCA 


AGGTATTAAT 


300 




TTATTAACAT 


AACAAAAAAA 


TATTTCAATT 


CGTAAAAT CT 


TATTAGTGTG 


TTCAAAATTT 


360 




CTAACATGTT 


TTTCTTTGTT 


CTGTTCTAGT 


GCGTCGACCT 


CAACCACACG 


ACGTTGAGCC 


420 


15 


ACGAAATCAT 


CCCAGACCGG 


AATGCGTGCA 


TCTTAGAGAA 


CTACACCTGG 


GAGAACTCAC 


480 




CGATGAACTT 


TGACCATGTC 


GGCAAGGCGT 


ATCTCTGCCT 


GTTCCAAGTG 


GCCACCTTCA 


540 




AGGGATGGAT 


ACAGATCATG 


AACGACGC 








568 



20 

Claims 

1. An isolated nucleic acid fragment comprising a nucleic acid sequence encoding a non-dipteran sodium 
25 channel, or portion thereof. 

2. The fragment of Claim 1 in which the channel is either lepidopteran, coleopteran or homopteran. 

3. The fragment of Claim 2 which is lepidopteran. 

30 

4. The fragment of Claim 3 which is derived from Heliothis , Helicoverpa or Spodoptera . 

5. The fragment of Claim 4 which is derived from Heliothis virescens , Heliothis armigera , or Helicoverpa 
zea. 

35 

6. The fragment of Claim 1 which hybridizes with a nucleic acid sequence depicted in Figure 1 under 
medium or high stringency conditions. 

7. The fragment of Claim 1 which comprises all or a portion of the sequence depicted in Figure 1. 

40 

a The fragment of Claim 1 which is capable of being used as a probe to detect RFLPs in an insect 
population comprising both pyrethroid sensitive and pyrethroid resistant individuals. 

9. The fragment of Claim 1 which is detectably labelled. 

45 

10. An isolated nucleic acid fragment deposited with the American Type Culture Collection under Acces- 
sion No. 75334. 

11. A vector comprising the fragment of Claim 1 . 

50 

12. A host cell comprising the vector of Claim 11. 
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